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Run on : 



Title: 

Perfect score: 
Sequence : 



September 29, 2004, 18:02:07 ; Search time 103.384 Seconds 

(without alignments) 
3219.466 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 



1586107 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_29 Jan04 : * 

1: geneseqpl980s : * 

2: geneseqp!990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6: geneseqp2003as : * 

7: geneseqp2003bs : * 

8 : geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAY71311 

ID AAY71311 standard; protein; 1178 AA. 
XX 

AC AAY71311; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Human neurite growth inhibitor Nogo. 
XX 

KW Human; neurite growth inhibitor; Nogo; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 



KW structural plasticity; screening. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Misc-dif ference 187 

FT /label= Unknown 

FT Misc-dif ference 188 

FT /label= Unknown 

FT Misc-dif f erence 189 

FT /label= Unknown 

FT Misc-dif f erence 190 

FT /label= Unknown 

FT Misc-dif f erence 221 

FT /label= Unknown 

FT Misc-dif ference 328 

FT /label= Unknown 

FT Misc-dif ference 477 

FT /label= Unknown 

FT Region 977. .1012 

FT /note= "Region specifically described in claim 16" 

FT Region 994. .1174 

FT /note= "Region specifically described in claim 16" 

FT Region 1079. . 1114 

FT /note= "Region specifically described in claim 16" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 

PR 06-NOV-1998; 98US-01 07 44 6P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 11; Fig 13; 122pp; English. 
XX 

CC The present sequence is a human Nogo protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The human Nogo 

CC sequence was derived by aligning human expressed sequence tags (ESTs) 

CC e.g. AA158636, AA333267, AA081783, AA167765, AA322918, AA092565, AA081525 

CC and AA081840 with the rat Nogo sequence. Nogo proteins and fragments 

CC displaying neurite growth inhibitory activity are used in the treatment 

CC of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 



CC Alzheimer's and Parkinson's diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysprolif erative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to .induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1178 AA; 

Query Match 99.3%; Score 5882; DB 3; Length 1178; 

Best Local Similarity 99.7%; Pred. No. 5.4e-296; 

Matches 1175; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I M I 1 I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSVPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPT APAAGAP LMD FGN D FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VP AP 12 0 

I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I II M I I I I I I M I I II I 
Db 61 AG L S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP AP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRG 18 0 

I I I I I I I I I I I I I I i I I I I I I II I 1 I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I II 
Db 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

I I I M I I I I I I I I I I I I I t I I II I II I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I 
Db 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I M 

Db 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEK7VKDSFNEKRV 360 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

II I I I I I I I I I I II I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I 

Db 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPTSENXTDE 480 

I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPTSENXTDE 480 

Qy 4 81 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYWTDNLTKVTEEVVANMPEGLTPDL 540 

I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 481 KKIEEKKAQIWEKNTSTKTSNPFFVAAQDSCTDYVTTDNLTKVTEEWANMPEGLTPDL 54 0 



Qy 



541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 



Db 541 VQEACESELNEWGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I I II I I 

Db 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

II I I I I I I I II I I 1 I II I I I I I I I I I I I M I I I I I I I I I II I I I I M I I I I I I I I I I II I 

Db 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

I I I II I I I I I I I I I I I II I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 
Db 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I M I I M 
Db 781 GKPYLESFKLSLVNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVXLLPPDVSALGHTQ7VEIESIVKPKVLE 960 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

I I I II I I I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I 
Db 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I II I ! I I I I 
Db 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

Qy 1081 ALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

I I I I I I I I I I I I I I I II I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

Qy 1141 YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

I I I I M II I I I II I I I I I I I I I I I I M I I I I I I I I I I I 
Db 1141 YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 



RESULT 2 
AAU04591 

ID AAU04591 standard; protein; 1192 AA. 
XX 

AC AAU04591; 
XX 

DT 26-SEP-2001 (first entry) 
XX 

DE Human Nogo protein. 
XX 

KW Human; Nogo receptor; axonal growth; immunogen; antibody; nogo protein; 

KW cranial trauma; cerebral trauma; spinal cord injury; stroke; 

KW demyelinating disease; multiple sclerosis; monophasis demyelination; 



KW encephalomyelitis; multifocal leukoencephalopathy; panencephalitis; 

KW Marchiaf ava-Bignami disease; pontine myelinolysis ; adrenoleukodystrophy; 

KW Pelizaeus-Merzbacher disease; Spongy degeneration; Alexander's disease; 

KW Canavan's disease; metachromatic leukodystrophy; viral infection; 

KW Krabbe 1 s disease. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Domain 1054, .1119 

FT /label= Lumenal_extracellular_domain 

FT /note= "This sequence is specifically claimed" 

FT Peptide 1055. .1094 

FT /label= Pepl 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1064. .1088 

FT /label= Pep2 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1074. .1098 

FT /label= Pep3 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1084. .1108 

FT /label= Pep4 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

FT Peptide 1095. .1119 

FT /label= Pep5 

FT /note= "Receptor binding inhibitory peptide. This 

FT sequence is specifically claimed" 

XX 



PN WO200151520-A2. 
XX 

PD 19-JUL-2001. 
XX 

PF 12-JAN-2001; 2001WO-US001041 . 
XX 

PR 12-JAN-2000; 2 000US-0175707P . 

PR 26-MAY-2000; 2000US-0207366P . 

PR 29-SEP-2000; 2000US-0236378P . 
XX 

PA (UYYA ) UNIV YALE. 
XX 

PI Strittmatter SM; 
XX 

DR WPI; 2001-442138/47. 

DR N-PSDB; AAS09453. 
XX 

PT Novel Nogo receptor protein useful for identifying modulator of Nogo 

PT protein or Nogo receptor protein, which is useful for treating central 

PT nervous system disorders. 
XX 

PS Example 1; Page 101-104; 109pp; English. 
XX 

CC The sequence is the human Nogo protein, a 250kDa myelin-associated axon 



CC growth inhibitor. The invention relates to the use of the nogo receptor, 

CC nogo protein, their nucleic acids, vectors expressing them and antibodies 

CC against them, to isolate agents which block nogo receptor mediated axonal 

CC growth. The agent is useful for treating a central nervous system 

CC disorder which is a result of cranial or cerebral trauma, spinal cord 

CC injury, stroke or a demyelinating disease selected from multiple 

CC sclerosis, monophasis demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis , Marchiaf ava-Bignami disease, 

CC pontine myelinolysis , adrenoleukodystrophy, Pelizaeus-Merzbacher disease, 

CC Spongy degeneration, Alexander f s disease, Canavan 1 s disease, 

CC metachromatic leukodystrophy, viral infection and Krabbe ' s disease 

XX 

SQ Sequence 1192 AA; 

Query Match 98.2%; Score 5815; DB 4; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.6e-292; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I II I II I I I I 1 I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAP PVAPERQPSWDPS PVS STVPAP 120 

I I I I I II I I II I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I 
Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAP PVAPERQPSWDPS PVS STVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I II I I I M I I I I I I I I I I I I I I II I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

IN: I : : I I I I I I I I I I I I I I I M I I! I I I I I I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I II I I I I I I M I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I II 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTT^TKLVKEDEW 346 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

Qy 347 S SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDSKEDSDMLAAGGKI ESNLES KV 406 

I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 61 S SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

Ml I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIWEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I II I I I I I I I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 



Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

I I I I I I I I I I I I I I I I I I I II I I I I 1 I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 1 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I II I I I M I I I I I I I I I I I I I I t t II I I I I I I t I II I I I I I I II II I I I II 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I I I I I I I I I I I II I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 7 66 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 82 5 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVST^LGH 945 

I I I I M I I I I I I I I I I I I I I i II M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I : I I i I I I I I I I I I I I I I I M 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I I I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 107 9 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I II I II I I I I I I I I II I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 3 
ABP68600 

ID ABP68600 standard; protein; 1192 AA. 
XX 

AC ABP68600; 
XX 

DT 14-JAN-2003 (first entry) 
XX 

DE Human pancreatic cancer expressed protein SEQ ID NO 71. 
XX 

KW Human; pancreas; cancer; gene therapy; vaccine; immuno stimulant; 

KW cytostatic; tumour. 

XX 

OS Homo sapiens. 
XX 



PN WO200260317-A2. 
XX 

PD 08-AUG-2002. 
XX 

PF 30-JAN-2002; 2002WO-US00278 1 . 
XX 

PR 30-JAN-2001; 

PR 31-JAN-2001; 

PR 09-FEB-2001; 

PR 21-MAR-2001; 

PR 28-APR-2001; 

PR 16-MAY-2001; 

PR 12-JUL-2001; 

PR 20-AUG-2001; 

PR 27-NOV-2001; 
XX 

PA (CORI-) CORIXA CORP. 
XX 

PI Benson DR, Kalos MD, Lodes MJ, Persing DH, Hepler WT, Jiang Y; 
XX 

DR WPI; 2002-627435/67. 

DR N-PSDB; ABV94680. 
XX 

PT New isolated polynucleotide and pancreatic tumor polypeptides, useful for 

PT diagnosing, preventing and/or treating cancer, particularly pancreatic 

PT cancer. 
XX 

PS Claim 2; SEQ ID NO 71; 300pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated polynucleotide (I) comprising: (a) 

CC any of a group of over 4000 nucleotide sequences (ABV94 628-ABV99145 ) ; (b) 

CC complements of (a); (c) sequences consisting of at least 20 contiguous 

CC residues of (a) ; (d) sequences that hybridize to (a) , under moderately 

CC stringent conditions; (e) sequences having at least 75% or 90% identity 

CC to (a); or (f) degenerate variants of (a). Polypeptides (ABP68596- 

CC ABP68637) encoded by (I) and oligonucleotide can be used to detect cancer 

CC in a patient and compositions comprising polypeptides, polynucleotides, 

CC antibodies, fusion proteins, T cell populations and antigen presenting 

CC cells expressing the polypeptide are useful in treating pancreatic cancer 

CC and stimulating an immune response. The polynucleotides can be used as 

CC probes or primers for nucleic acid hybridisation, in the design and 

CC preparation of ribozyme molecules for inhibiting expression of the tumour 

CC polypeptides and proteins in the tumour cells, in vaccines and for gene 

CC therapy. Note: The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 
XX 

SQ Sequence 1192 AA; 

Query Match 98.2%; Score 5815; DB 5; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.6e-292; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I II I I I I I I I I II I I 1 I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



2001US-0265305P. 
2001US-0265682P. 
2001US-0267568P. 
2001US-0278651P. 
2001US-0287112P. 
2001US-0291631P. 
2001US-0305484P. 
2001US-0313999P. 
2001US-0333626P. 



Qy 61 AGL S AAP VP T APAAGAPLMD FGND FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP AP 120 

I I I I I i I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I II II I I I I I I I I I I I I I I I I I 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

II I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I M I I I I II II I I I II I I I I I I I I I I I 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA- WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

Ml: I : : I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDM1AAGGKIESNLESKV 4 06 

I I I I I 1 I I I I I I I I I I I I i I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVT^QDSETDYVTTDNLTKVTE 526 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 

Db 481 PLLGDPTSENKTDEKKIEEKKAQIWEKNTSTKTSNPFLVT^QDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I I I I I I I I II I I I I I I I I 1 I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I II I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I M I II I I I II I II I I 

Db 661 EAMSVSLKKVSGIKEEIKEPENIN7 1 JVLQETEAPYISIACDLIKETKLS7VEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 MAKVEQPVPDHSELVEDS S PDSEPVDLFSDDS I PDVPQKQDETVMLVKES LTETS FESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I II I I I I I I II I I I I I I I I I I I I I M I II I I I I II I M I I II M I II M I I I I I I I I I I I 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 



Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 I I I I I I I I I I I I I I I I I 
Db 1020 VFGASLFLLLS LTVFS I VSVTAYI ALALLSVTI S FRI YKGVIQAI QKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1125 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVliMWFTYVGALFNGLT 1139 



Qy 


1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 




1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 


Db 


1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 


RESULT 4 


ABR59667 


ID 


ABR59667 standard; protein; 1192 AA. 


XX 




AC 


ABR59667 ; 


XX 




DT 


22-JUL-2003 (first entry) 


XX 




DE 


Human NogoA protein. 


XX 




KW 


Human; Nogo receptor; NgR; CTS domain; neuroprotective; gene therapy; 


KW 


axonal growth; central nervous system; CNS; Nogo; spinal cord injury; 


KW 


cranial trauma; cerebral trauma; spinal trauma; stroke; Krabbe's disease; 


KW 


demyelinating disease; multiple sclerosis; monophasic demyelination; 


KW 


encephalomyelitis; multifocal leukoencephalopathy ; panencephalitis. 


XX 




OS 


Homo sapiens. 


XX 




PN 


WO2003031462-A2. 


XX 




PD 


17-APR-2003. 


XX 




PF 


04-OCT-2002; 2002WO-US032007 . 


XX 




PR 


06-OCT-2001; 2001US-00972599 . 


XX 




PA 


(UYYA ) UNIV YALE. 


XX 




PI 


Strittmatter SM; 


XX 




DR 


WPI; 2003-393433/37. 


DR 


N-PSDB; ACC81048. 


XX 




PT 


New human Nogo receptor polypeptides and nucleic acids , useful for 


PT 


decreasing inhibition of axonal growth by a central nervous system 


PT 


neuron, or in treating central nervous system disease, disorder or 


PT 


injury, e.g. spinal cord injury. 



XX 

PS Disclosure; Page 131-135; 148pp; English. 
XX 

CC The invention relates to a novel nucleic acid encoding a polypeptide 

CC comprising amino acid residues 27-309 of a 473 amino acid sequence (PI, 

CC human Nogo receptor (NgR) NTLRRCT domain) , or residues 27-309 of PI with 

CC 1-20 conservative amino acid substitutions, and less than a complete CTS 

CC domain, provided that a partial CTS domain, if present, consists of no 

CC more than the first 39 consecutive residues. The nucleic acid of the 

CC invention has neuroprotective activity. The polynucleotide may have a use 

CC in gene therapy. The nucleic acid is useful for decreasing inhibition of 

CC axonal growth by a central nervous system (CNS) neuron. The NgR 

CC polypeptide or an agent inhibits the binding of Nogo to NgR or NgR- 

CC dependent signal transduction in the central nervous system neuron may be 

CC used in treating central nervous system disease, disorder or injury, e.g. 

CC spinal cord injury. Expression of an NgR protein may be associated with 

CC inhibition of axonal regeneration following cranial, cerebral or spinal 

CC trauma, stroke or a demyelinating disease, such as multiple sclerosis, 

CC monophasic demyelination, encephalomyelitis, multifocal 

CC leukoencephalopathy, panencephalitis, or Krabbe 1 s disease. The present 

CC sequence is used in the exemplification of the invention 

XX 

SQ Sequence 1192 AA; 

Query Match 98.2%; Score 5815; DB 6; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.6e-292; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I M I I I M I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPTAP AAGAP LMD FGND FVP PAP RGP L P AAP P VAP E RQ P S WD P S P VS ST VP AP 120 

I I I I M I I I I I I I II I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I M I I 
Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I II I I I I I I I I I I I I II II II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

III: I : : I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 28 6 

I I I I I I II I I II I I I I I I I I II I I I I I I M II I I I I I I I I I II I II I I I I I I I I I I I I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 346 

I I II II I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEY7VDFKPFERWEWDSKEDSDMLAAGGKIESNLESKV 406 

I II I I I I I I I I I I I I I I I I I I I I I II I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 



Qy 



407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 



Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

Ml I I I M I II I II II I I I I I I I I I I t I I I I I I I I I I I I I I II I I I I II I 1 I I I I I I 
Db 4 81 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II II I I I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I II I I I II I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I II I I I I I I I I I I I I I 1 I I I I I I I I I I ! I II I I I II I I I I I I I I I I I I I I I II 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 8 86 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

II I I I I I I I I II I I I I I I I I I I I li I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I 

Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I I I I I I I I I I I I I 

Db 960 TQAEI ES I VKPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELSKTS WDLLYWRDI KKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1125 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1080 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLI LALI SLFSVPVI YERHQAQIDHYLGL7VNKNVKDAMAKIQAKI PGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 

RESULT 5 
AAY56967 

ID AAY56967 standard; protein; 1192 AA. 
XX 

AC AAY56967; 



XX 

DT 25-APR-2000 (first entry) 
XX 

DE Human MAGI polypeptide. 
XX 

KW MAGI protein; neuroendocrine-specif ic protein; neuropathy; human; 

KW spinal injury; neuronal degeneration; neuromuscular disorder; cancer; 

KW psychiatric disorder; developmental disorder; inflammatory disorder; 

KW stroke; cytostatic; cerebroprotective; neuroprotective. 

XX 

OS Homo sapiens. 
XX 

PN WO200005364-A1 . 
XX 

PD 03-FEB-2000. 
XX 

PF 21-JUL-1999; 99WO-GB002360 . 
XX 

PR 22-JUL-1998; 98GB-00016024 . 

PR 19-JUL-1999; . 99GB-000168 98 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha RK; 
XX 

DR WPI; 2000-182693/16. 

DR N-PSDB; AAZ56886. 
XX 

PT Novel polypeptides related to neuroendocrine-specif ic proteins and 

PT polynucleotides useful for diagnosis of various diseases and for 

PT treatment of cancer and neurological disorders. 
XX 

PS Claim 2; Page 20-21; 35pp; English. 
XX 

CC The invention relates to human MAGI protein, which is similar to 

CC neuroendocrine-specif ic protein. The MAGI protein can be expressed by 

CC standard recombinant methodology. The MAGI polypeptides, polynucleotides 

CC and antibodies are useful for treating diseases, including neuropathies, 

CC spinal injury, neuronal degeneration, neuromuscular disorders, 

CC psychiatric disorders and developmental disorders, cancer, stroke and 

CC inflammatory disorders. The polynucleoitde is also useful for chromosome 

CC localization and for tissue expression studies. The present sequence 

CC represents the human MAGI protein 

XX 

SQ Sequence 1192 AA; 

Query Match 98.1%; Score 5810; DB 3; Length 1192; 
Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Db 




60 



Qy 



61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 



120 




Db 



61 AGL S AAP VPT APAAGAP LMD FGN D EVP P AP RG P L PAAP P VAP E RQ P S WD P S P VS ST VP AP 



120 



Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

I I I I I I I I I I II I I I I I 11 I I I I I I I I I I I II I I I I I I I I I M I ! I I I I II M I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

III: I : : I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 24 0 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 34 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 301 GSS FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 S SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDS KEDSDMLAAGGKI ESNLES KV 4 06 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 361 S SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDS KEDSDMLAAGGKI ESNLES KV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

III I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

I I I II I II M I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 72 0 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I M I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 



Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I I I I I I I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I II I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ7VKIPGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 6 
AAB82349 

ID AAB82349 standard; protein; 1192 AA. 
XX 

AC AAB8234 9; 
XX 

DT 23-JUL-2001 (first entry) 
XX 

DE Human NOGO-A protein. 
XX 

KW NOGO-A; human; chromosome 2p21; neuropathy; spinal injury; brain injury; 

KW stroke; neuronal degeneration; Alzheimer's disease; Parkinson's disease; 

KW neuromuscular disorder; psychiatric disorder; developmental disorder; 

KW neuroprotective; nootropic; neuroleptic; antiparkinsonian; 

KW cerebroprotective; neuroleptic; diagnosis; therapy. 
XX 

OS Homo sapiens. 
XX 

PN WO200136631-A1. 
XX 

PD 25-MAY-2001. 
XX 

PF 14-NOV-2000; 2000WO-GB004345 . 
XX 

PR 15-NOV-1999; 99GB-00026995 . 

PR 24-JAN-2000; 2000GB-00001550 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Michalovich D, Prinjha R; 
XX 

DR WPI; 2001-343822/36. 

DR N-PSDB; AAF90324. 
XX 

PT New polypeptide designated NOGO-C is a splice variant of the human NOGO 

PT gene and may be useful in the treatment of neural disorders including 

PT Alzheimer's and Parkinson's diseases. 
XX 

PS Disclosure; Page 26-27; 25pp; English. 
XX 



CC The present sequence is that of human NOGO-A. NOGO-A is a previously 

CC known splice variant of the human NOGO gene on chromosome 2p21. The 

CC invention relates to a novel splice variant, NOGO-C (see AAB82348). It 

CC provides NOGO-C polypeptides and polynucleotides , and methods for 

CC producing such polypeptides by recombinant techniques. Also disclosed are 

CC methods for utilising NOGO-C polypeptides and polynucleotides in the 

CC treatment of diseases including neuropathies, spinal injury, brain 

CC injury, stroke, neuronal degeneration, for example Alzheimer ! s disease 

CC and Parkinson's disease, neuromuscular disorders, psychiatric disorders 

CC and developmental disorders. Also provided are methods for identifying 

CC agonists and agonists for use in treating conditions associated with NOGO 

CC -C imbalance, and diagnostic assays for detecting diseases associated 

CC with inappropriate NOGO-C activity or levels 

XX 

SQ Sequence 1192 AA; 



Query Match 98.1%; Score 5810; DB 4; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

II I I I 1 I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VP T AP AAGAP LMD F GN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP AP 120 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 61 AGL S AAP VP TAP AAGAP LMD FGND FVP PAP RG P L PAAP P VAP ERQ P S WD P S P VS S T VP AP 12 0 



Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I II I I I I I I I I I I I I I I II I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRG 180 



Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

III: I : : I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 



Qy 287 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 346 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I 
Db 301 GSS FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 



Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

. II I I M I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I II II II I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 



Qy 4 07 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIF 4 66 

I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFP ST PEGIKDRPGAYITCAP FNPAATESIATNIF 480 



Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 



Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II I I I I I M 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I II I I I I I I I I I I I II II I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I M I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 825 

I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | 
Db 781 EYENKEKLSALPPEGGKPYLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I II I I I I II I I I II I I I I I I I II I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I II I I I I I I I I II I II I I I I II I I I I I I I I I I : I I I M || I I I I I I I I M I | 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

I I I I I I I I I I I I I I I II I II I II I I I I I II I I I I I I I II I II I I I I I I I I I I I I I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLILALI SLFSVPVI YERHQAQI DHYLGIANKNVKDAMAKI QAKI PGLKRKAE 1178 

I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1140 LLIIALISLFSVPVIYERHQAQ I DHYLGIANKNVKD7\MAKI QAKI PGLKRKAE 1192 



RESULT 7 
ABG30938 

ID ABG30938 standard; protein; 1192 AA. 
XX 

AC ABG30938; 
XX 

DT 21-OCT-2002 (first entry) 
XX 

DE Human NogoA protein. 
XX 

KW Human; Nogo; BACE; acute neuronal injury; spinal injury; head injury; 

KW stroke; peripheral nerve damage; neoplastic disorder; glioblastoma; 

KW neuroblastoma; hyperprolif erative disorder; dysprolif erative disorder; 



KW cirrhosis; psoriasis; keloid formation; fibrocystic condition; cancer; 

KW tissue hypertrophy; central nervous system; axon regeneration; NogoA; 

KW Nogo-associated disease; metastasis. 
XX 

OS Homo sapiens. 
XX 

PN WO200257483-A2. 
XX 

PD 25-JUL-2002. 
XX 

PF 18-JAN-2002; 2002WO-GB000228 . 
XX 

PR 18-JAN-2001; 2001GB-00001312 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 

PA (SMIK ) SMITHKLINE BEECHAM PLC. 

XX 

PI Blackstock WP, Hale RS, Prinjha R, Rowley A; 
XX 

DR WPI; 2002-599722/64. 

DR N-PSDB; ABK90134. 
XX 

PT Identifying modulators of Nogo or BACE activity for treating acute 

PT neuronal injuries, neoplastic or dysprolif erative disorders, comprises 

PT providing and monitoring interaction between Nogo and BACE polypeptides. 

XX 

PS Disclosure; Page 59-62; 68pp; English. 
XX 

CC The present invention relates to a new method of identifying modulators 

CC of Nogo function or BACE activity. The method involves providing Nogo and 

CC BACE polypeptides capable of binding with each other, monitoring the 

CC interaction between these polypeptides, and determining if the test agent 

CC is a modulator of Nogo or BACE activity. The method is useful in treating 

CC acute neuronal injuries, such as spinal or head injury, stroke, 

CC peripheral nerve damage, and in neoplastic (e.g. glioblastomas, 

CC neuroblastomas), hyperprolif erative or dysprolif erative disorders (e.g. 

CC cirrhosis, psoriasis, keloid formation, fibrocystic conditions, tissue 

CC hypertrophy) of the central nervous system. The BACE polypeptide is 

CC useful in screening methods to identify agents that may act as modulators 

CC of BACE activity and in particular agents that may be useful in treating 

CC Nogo-associated diseases. The modulators of Nogo or BACE polypeptides, 

CC and the polynucleotide encoding the BACE polypeptide are useful in 

CC manufacturing a medicament for the treatment or prevention of disorders 

CC responsive to the modulation of Nogo activity, in alleviating the 

CC symptoms or improving the condition of a patient suffering from this 

CC disorder, in axon regeneration, or in preventing metastasis or spreading 

CC of a cancer. The polynucleotide may also be an essential component in 

CC assays, a probe, in recombinant protein synthesis, and in gene therapy 

CC techniques . The present amino acid sequence represents the human NogoA 

CC protein of the invention 

XX 

SQ Sequence 1192 AA; 



Query Match 98.1%; Score 5810; DB 5; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | || I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPT APAAGAP LMD FGN D EVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VPAP 120 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I | M I I I I I I I 
Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

Qy 121 S PLS AAAVS P S KLP EDDE P PARP P P P P PAS VS PQAE PVWT P PAP AP AAP P ST PAAPKRRG 180 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | | | | | | | | | | | | 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

Ml: I : : I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I 1 I I I II I II I I I I I I I I I I I I I I I II II I I II M | | | | || | || | 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

I I I I I I I I I II II I II I I I I I I I I I I I I I I I I I I I I II I II I I I I I I II II I I I I I I I I 
Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I II I II I II I I I I I I I II I II I I I I I I I I I I I I I I I I I I II I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

IN I M I I I II I II I I I II M I M I I I I I I I I I I I I I M II I I I I II II M I I I I I I 
Db 4 81 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I II I I I I I I I I I I I I I I II I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

I I I I I I I I I I I I I I I II I I I I I I I II I I I I 11 I I I I I I I II I I I I I I I I I I I I I I I I II I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I II I II II I I I I I I I II I I I I I I II I I II I II I I I I I I I I I I I I I I I I I I I I I M 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 72 0 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I II I I I I I I I I I I I I II II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 721 MAfCVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 7 66 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 



Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I I I I I I I M I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | M I I I I I I I I I I I | | 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

QY 946 TQAEIESIVKPKVXEKEAEKKLPSDTEKEDRSPSAIFS7U)LGKTSVVDLLYWRDIKKTGV 1005 

I M II I I I I I I I I I I I I I I I I I I I I I I M | I I ! | | | | | : | | | | | | | | M | | M I I I II 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I II I I I I I I I I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

I M I I I I I I II I I I I I I I I I M I I || I | | | | | | M I I I I I II I I I I I II I I I I I I | | M I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I M I I I I I I I I I I I I I I II I I II I I I I I II I I || I I I I I I I I | 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 

RESULT 8 
ABB81078 



ID ABB81078 standard; protein; 1192 AA. 
XX 

AC ABB81078; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Human neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy; 

KW neurotransmitter receptor; human; receptor. 
XX 

OS Homo sapiens. 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-008 9334 8 . 
XX 

PR 19-MAY-1998; 98IL-00124500 . 

PR 21-JUL-1998; 98WO-US014715 . 

PR 22-DEC-1998; 98US-00218277 . 

PR 19-MAY-1999; 99US-00314161 . 
XX 

PA (YEDA ) YEDA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 



PI Moalem G; 
XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86601. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from in j ury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example; Page 53-56; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue or 

CC combinations. The method is useful for promoting nerve regeneration and 

CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson f s 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington f s chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt- Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the human 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 1192 AA; 

Query Match 98.1%; Score 5810; DB 5; Length 1192; 
Best Local Similarity 97.2%; Pred. No. 2.9e-292; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

M I I I I I M I I I I I I I I I I I I I I I I II I I I | | M I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VP T APAAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S P VS S T VP AP 120 

I M I I I I I I I I II I I M I I I t I I I I I I I I I I I I I I I I | | I I | I | | | | | | | | | | | || | | | | 

Db 61 AGL S AAP VP T APAAGAP LMD FGN D FVP PAP RGP L P AAP P VAP E RQ P S WD P S P VS S T VPAP 120 



Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

ll>= I : : II I I I I M I I I i I II I I I I I I I I I I I I I | | | | | 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

QY 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

MINIMI I I I I I I I || | | I || I I I | | | | | | | M I I I I I I 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 2 87 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 34 6 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I | | | | | 
Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

I I M I II I I I I I II I I I I I I I I I I I I I | | | | M I U II I I I I I I I I I I I I I I | I | | | | | | 
Db 361 S SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDS KEDSDMLAAGGKI ESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

M I I II I II I I I I I I I I I I I I I II I | I I | | | | || I I I I I I I I I I I I I II 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKWE 526 

I I I MINI M I I M I I I I I I II II I I I I I I I I II I I I I II I I I I I I M I I I II I I I 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I M I I I I I I II I I I I I I I I I I I I II I I I | | | | || || || || | | | | | | | | | | | | | | | | | | | 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I M I I I I I I II I I II I I I I I I I I I I I I I | | | M | | | | | | | | | | | | | || | | | | | | | | || | 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I M I I I I I I I I II I I I || I I M I M I I I I I I I M I I I I I I I | | | | | | | | | | | | | | | | | 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I M I I I I I M I I I I I I I I I I II I I I I I II I I I II I I II I I I I I I I I I I I I 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I M I I I II I I I I I I I I I I 1 I || I | I | | | | | | | | | | | M | | | | | | | | | | | | | | | M | | | | 
Db 781 EYENKEKLSALPPEGGKPYLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 84 0 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I M I I II I I I I I I I I II I I I I I I I I | || | | | | | | | | || | | | | | || | || I I I I I I II I II 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKIAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I M I I I I I I I II I I I I II I II I I I || | | | | | | | || | | | | | | | | | | | | | | | | | | || | | | 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQT^EIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

M I I I I I I I II I II I I I I I I I I I I II I I | I I | | | | | | | : | I I I | I M II I I II I I II I 



Db 


960 


T OAE I E S I VKP KVL VKEAE KKL P S DT E KF, H P 


1 f|1 Q 
-L u X _? 


Qy 


1006 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 


1065 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 




Db 


1020 


VFGASLFLLLSLTVFSIVSVTAYIAT.ATJ.^VTT ^FRTYKnVTDATnvcinpnKPPRavT pc 


i n 7 q 


Qy 


1066 


EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 


1125 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1080 


EVAI S EELVQKYSN S ALGHVNCT I KELRRLFLVDDLVDS LKFAVLMWVFT YVGAL FNGLT 


1139 


Qy 


1126 


LLII^VLISLFSVPVIYERHQAQIDHYLGIJU^KNWDAMAKIQAKIPGLKRKAE 1178 








Ml 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 | | M | | | | | | | | | | | | | | | M | | | 




Db 


1140 


LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 




RESULT 9 








AAU33228 








ID AAU33228 


standard; protein; 1246 AA. 





XX 

AC AAU33228; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Novel human secreted protein #3719. 
XX 

KW Human; vaccination; gene therapy; nutritional supplement; 

KW stem cell proliferation; haematopoiesis ; nerve tissue regeneration; 

KW immune suppression; immune stimulation; anti-inflammatory; leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200179449-A2. 
XX 

PD 25-OCT-2001. 
XX 

PF 16-APR-2001; 2001WO-US008656 . 
XX 

PR 18-APR-2000; 2000US-00552929 . 

PR 26-JAN-2001; 2001US-00770160 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Drmanac RT; 
XX 

DR WPI; 2001-611725/70. 
XX . 

PT Nucleic acids encoding a range of human polypeptides, useful in genetic 

PT vaccination, testing and therapy. 

XX 

PS Claim 20; Page 737; 765pp; English. 
XX 

CC The invention relates to novel human secreted polypeptides. The 

CC polypeptides and antibodies to the polypeptides are useful for 

CC determining the presence of or predisposition to a disease associated 

CC with altered levels of polypeptide. The polypeptides are also useful for 

CC identifying agents (agonists and antagonists) that bind to them. Cells 

CC expressing the proteins are useful for identifying a therapeutic agent 



CC for use in treatment of a pathology related to aberrant expression or 

CC physiological interactions of the polypeptide. Vectors comprising the 

CC nucleic acids encoding the polypeptides and cells genetically engineered 

CC to express them are also useful for producing the proteins. The proteins 

CC are useful in genetic vaccination, testing and therapy, and can be used 

CC as nutritional supplements. They may be used to increase stem cell 

CC proliferation; to regulate haematopoiesis ; and in bone, cartilage, tendon 

CC and/or nerve tissue growth or regeneration; immune suppression and/or 

CC stimulation; as anti-inflammatory agents; and in treatment of leukaemias. 

CC AAU29510-AAU33304 represent the amino acid sequences of novel human 

CC secreted proteins of the invention 

XX 

SQ Sequence 124 6 AA; 

Query Match 93.3%; Score 5526.5; DB 4; Length 1246; 

Best Local Similarity 93.0%; Pred. No. 1.5e-277; 

Matches 1123; Conservative 13; Mismatches 39; Indels 33; Gaps 10; 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 
MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 101 

AGL S AAP VP T APAAGAP LMD FGNDFVP PAP RG P L PAAP P VAP ERQPSWDPS P VS S T VP AP 12 0 
I I I I I I M I I I I I I I II I I I I I II I I i I I I I I I I II I I I I I I I I || I I | | | | | | | | | | | | 
AGL SAAPVPTAPAAGAP LMD FGNDFVP PAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 161 

S P LSAAAVS PSKLPEDDEP P ARP P P P P P AS VS P QAE P VWT P PAPAPAAP P S T PAAP KRRG 180 
I I M I M I I I I I I I I I I I I I I I I I I I M I | I I I I | | | | | | | | | || | | | | | | | | | | | | | | | 
SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 221 

SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

I : : M I I I I I I I I I I I I I I I I I I M I I I I M I II I I 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 281 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 2 86 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I M I I I 
LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 341 

GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 34 6 
I M I I I I II II II II I I I I II I I II I I I I I | | | M | | | | | | | || | | | | | | | | M | | | | | 
GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 4 01 

SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 
I I I I I I I I I I I I I I I I I I I I I I II I I I I II I II I I I I II I I I I II I I I I II I I I I I I I I I 
SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 4 61 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 4 66 
I I M I M I I I I I I I I I I I I I I I I I II I I I II I I I I I I I 1 I I I I I I I I II I I I I I | | | I I I 
DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 521 

PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 
Ml I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 581 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

I I I I I I I I I I I I I I I M I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 641 



Qy 


1 


Db 


42 


Qy 


61 


Db 


102 


Qy 


121 


Db 


162 


Qy 


181 


Db 


222 


Qy 


227 


Db 


282 


Qy 


287 


Db 


342 


Qy 


347 


Db 


402 


Qy 


407 


Db 


462 


Qy 


467 


Db 


522 


Qy 


527 


Db 


582 



Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I M I I I I I I I I I I I I I I I I I I | I | | | | | | | | | M I I I I I I I I I I I I I I I II I II I I I I I 
Db 642 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 701 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I II I | | | | | || | | | | || || | | || | | || | | | | | | | M 

Db 702 EAMSVSLKKVSGIKEEI KEPENINAALQETEAPYI S IACDLI KETKLSAEPAPDFSDYSE 761 

QY 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I II I I I I II I I I 
Db 762 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 821 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I M I It I I I II I II I I I I II I I I II I II I I I I I || | | | || | | || | | | | | | | | | | | M 
Db 822 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 881 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | | M I 
Db 882 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 941 

Qy 8 86 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I M I II II I I I I I I M I II I I I I I I I I I I I I I I || I | | || | | | | | | M | | | | M I I I I 
Db 942 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAIiA- 1000 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I M I I I I I I I I I I I I I I II I I I I I I I I I I I | | | | | | | : | | || || | | | | | | | | | | | | | 
Db 1001 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1060 

Qy 1006 VFGAS-LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY — 1062 

Mill : I I I I I I I II I II I I | | || | | | | | | | | | M | | | | | | | | | | | | | | || | | | | | 
Db 1061 VFGASAVFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAISG 1120 

Qy 1063 -LESEVAISEELVQKYSNSALGHV-NCTIKELRR LFLVDDLVDSLK-FAVXMWFTY 1116 

M I : : I : I I I I I I I : I I I : I I I I I II II : I M I I I I I I I 

Db 1121 NLESCLYLRELGSGRYSNSALGSMWNCTVKGNFRAPSFFSWMDLVDSLRSFAVLMWVFTY 1180 

Qy 1117 VGALFNGLTLL ILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKI 117 0 

I I I 1:1 | || : | | | | | | | | | | | || | | | | || | | | | | | | | | | || 

Db 1181 VGCL- -GLMVLDTTGFWALNFI S S SGSWLI YERHQAQI DHYLGLANKNVKDAMAKIQAKI 1238 

Qy 1171 PGLKRKAE 1178 

I I II II I I 
Db 1239 PGLKRKAE 1246 

RESULT 10 
ABU11573 

ID ABU11573 standard; protein; 983 AA. 
XX 

AC ABU11573; 
XX 

DT 12-FEB-2003 (first entry) 
XX 

DE Human MDDT polypeptide SEQ ID 520. 
XX 

KW MDDT; human; disease detection and treatment molecule polypeptide; 



KW anti-inflammatory; immunosuppressive; osteopathic; cytostatic; anti-HIV; 

KW haemostatic; nephrotropic; antianaemic; antipsoriatic; hepatotropic; 

KW gene therapy; protein replacement therapy; cell proliferative disorder; 

KW cancer; adenocarcinoma; leukaemia; lymphoma; melanoma; myeloma; sarcoma; 

KW anaemia; Crohn's disease; acquired immunodeficiency syndrome; AIDS; 

KW Goodpasture's syndrome; inflammation; osteoporosis; thrombocytopaenia; 

KW psoriasis; hepatitis. 
XX 

OS Homo sapiens. 
XX 

PN WO200279449-A2. 
XX 

PD 10-OCT-2002. 
XX 

PF 27-MAR-2002; 2002WO-US009944 . 
XX 

PR 28-MAR-2001; 2 001US-02 7 9619P . 

PR 29-MAR-2001; 2 001US-02 8 0067P . 

PR 29-MAR-2001; 2001US-02 80068P . 

PR 16-MAY-2001; 2001US-02 912 8 OP . 

PR 17-MAY-2001; 2001US-0291829P . 

PR 17-MAY-2001; 2 001US-02 918 4 9P . 

PR, 19-JUN-2001; 2001US-02 9942 8P . 

PR 20-JUN-2001; 2001US-0299776P . 

PR 20-JUN-2001; 2 001US-03 000 01P . 
XX 

PA (INCY-) INCYTE GENOMICS INC. 
XX 

PI Daffo A, Jones AL, Tran AB, Dahl CR, Gietzen D, Chinn J; 

PI Dufour GE, Hillman JL, Yu JY, Tuason O, Yap PE, Amshey SR; 

PI Daugherty SC, Dam TC, Liu TF f Nguyen DA, Kleefeld Y, Gerstin EH; 

PI Peralta CH, David MH, Lewis SA, Chen A J, Panzer SR, Harris B; 

PI Flores V, Marwaha R, Lo A, Lan RY, Urashka ME; 

XX 

DR WPI; 2003-058431/05. 

DR N-PSDB; ABX34563. 
XX 

PT New purified disease detection and treatment molecule proteins and 

PT polynucleotides, useful for diagnosing, treating or preventing cancers 

PT (e.g. leukemia or sarcoma), anemia, Crohn's disease, AIDS, osteoporosis 

PT or hepatitis . 

XX 

PS Claim 27; SEQ ID NO 520; 339pp + Sequence Listing; English. 
XX 

CC This invention describes a novel disease detection and treatment molecule 

CC polypeptide (MDDT) which has anti-inflammatory, immunosuppressive, 

CC osteopathic, cytostatic, anti-HIV, haemostatic, nephrotropic, 

CC antianaemic, antipsoriatic and hepatotropic activity. The polynucleotides 

CC and the polypeptides of the invention can be used for gene therapy, 

CC protein replacement therapy and are useful for treating a variety of 

CC diseases or conditions. These polypeptides or polynucleotides are 

CC particularly useful for diagnosing, treating or preventing cell 

CC proliferative disorders (e.g. cancers including adenocarcinoma, 

CC leukaemia, lymphoma, melanoma, myeloma or sarcoma), anaemia, Crohn's 

CC disease, acquired immunodeficiency syndrome (AIDS), Goodpasture's 

CC syndromes, inflammation, osteoporosis, thrombocytopaenia, psoriasis or 

CC hepatitis. ABU114 50-ABU118 45 represent the MDDT polynucleotides encoded 



CC by ABU11450-ABU11845, described in the disclosure of the invention. NOTE: 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences 

XX 

SQ Sequence 983 AA; 

Query Match 77.0%; Score 4560; DB 6; Length 983; 

Best Local Similarity 97.1%; Pred. No. 1.2e-227; 

Matches 919; Conservative 8; Mismatches 11; Indels 8; Gaps 3 



Qy 


240 


NLSTVL PTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVS 


293 






1 : : 1 : : 1 : = = 1 1 1 II i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | 




Db 


39 


NMNTLVICQQYYPLKEHFKKNVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVS 


98 


Qy 


294 


PKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKD 


353 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 II 1 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I M 1 1 




Db 


99 


PKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEWSSEKAKD 


158 


Qy 


354 


SFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFAD 


413 






1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 I I I I I I | I I I 




Db 


159 


SFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFAD 


218 


Qy 


414 


SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPT 


473 






1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II I I Ml 




Db 


219 


SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPT 


278 


Qy 


474 


SENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYWTDNLTKVTEEWANMP 


533 






Ml 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 || I I I I I I I I I || M | | | | | | | | | | | | | | | | | 




Db 


279 


SENKTDEKKIEEKKAQIWEKNTSTKTSNPFLVAAQDSETDYWTDNLTKVTEEWANMP 


338 


Qy 


534 


EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATP 


593 






1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | || | | | | | | | | | | | | | 




Db 


339 


EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATP 


398 


Qy 


594 


SPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL 


653 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 II 




Db 


399 


SPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSL 


458 


Qy 


654 


-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 


712 






1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


459 


KKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVUQP 


518 


Qy 


713 


VPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEK 


772 






1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


519 


VPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEK 


578 


Qy 


773 


LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 


832 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I II 1 1 1 1 I I 




Db 


579 


LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 


638 


Qy 


833 


SKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 


892 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I I 1 II M 1 1 




Db 


639 


SKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 


698 


Qy 


893 


GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVXLLPPDVSALGHTQAEIES 


952 






1 1 1 1 1 1 1 M 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | | 1 | M 1 





Db 699 GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA-TQAEIES 757 

QY 953 IVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLF 1012 

I I i I I I I I I I I I I I : I I I I I I I | | | | I I 

Db 758 IVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLF 817 

Qy 1013 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 1072 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 818 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 877 

QY 1073 LVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLI 1132 

I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 878 LVQKYSNSALGHWCTIKELRRLFLVDDW 937 

Qy 1133 SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I M I I II I I M I I I I II I I II I I II I I I I I I I I I II I I I I I I | | | 
Db 938 SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 983 



RESULT 11 




AAY95012 




ID 


AAY95012 standard; protein; 893 AA. 


XX 






AC 


AAY95012; 




XX 






DT 


19-JUN-2000 ( 


first entry) 


XX 






DE 


Human secreted 


protein vb22_l, SEQ ID NO: 64. 


XX 






KW 


Human; secreted protein; cancer; tumour; cardiovascular disorder; 


KW 


blood disorder 


; haemophilia; autoimmune disease; diabetes; inflammation; 


KW 


infection; fungal; bacterial; viral; HIV; allergy; arthritis; 


KW 


neurodegenerative disease; asthma; contraceptive. 


XX 






OS 


Homo sapiens. 




XX 






PN 


WO200011015-A1 




XX 






PD 


02-MAR-2000. 




XX 






PF 


24-AUG-1999; 


99WO-US019351. 


XX 






PR 


24-AUG-1998; 


98US-0097638P. 


PR 


24-AUG-1998; 


98US-0097659P. 


PR 


09-SEP-1998; 


98US-0099618P. 


PR 


28-SEP-1998; 


98US-0102092P. 


PR 


25-NOV-1998; 


98US-0109978P. 


PR 


23-DEC-1998; 


98US-0113645P. 


PR 


23-DEC-1998; 


98US-0113646P. 


PR 


23-AUG-1999; 


99US-00379246. 


XX 






PA 


(ALPH-) ALPHAGENE INC. 


XX 






PI 


Valenzuela D, 


Yuan O, Hoffman H, Hall J, Rapiejko P; 


XX 




DR 


WPI; 2000-224657/19. 


XX 







PT New* secreted or transmembrane proteins and polynucleotides encoding them, 

PT useful for treating neurodegenerative disorders , autoimmune diseases and 

PT cancer. 
XX 

PS Claim 73; Page 322-325; 357pp; English. 
XX 

CC The invention relates to 40 human secreted proteins (AAY94 981-Y9502 0 ) , 

CC and cDNA sequences encoding them (AAA23423-A23462 ) . The secreted proteins 

CC of the invention include those that are thought to be only partially 

CC secreted, i.e., transmembrane proteins. The proteins of the invention may 

CC exhibit one or more activities selected from the following: cytokine 

CC activity; cell proliferation; differentiation; immune modulation; 

CC haematopoiesis regulation; tissue growth activity; activin/inhibin 

CC activity; chemotactic/chemokinetic activity; haemostatic and thrombolytic 

CC activity; anti-inflammatory activity; and tumour inhibition activity. The 

CC proteins may be administered to patients as vaccines, and the nucleotides 

CC may be used as part of a gene therapy regime. Diseases or conditions that 

CC may be treated using the proteins or nucleotides of the invention include 

CC autoimmune diseases; genetic disorders; haemophilia; cardiovascular 

CC diseases; cancer; bacterial, fungal and viral infections, especially HIV; 

CC multiple sclerosis; rheumatoid arthritis; pulmonary inflammation; 

CC Guillain-Barre syndrome; insulin dependent diabetes mellitus; and 

CC allergic reactions such as asthma and anaemia. They may also be used for 

CC treating wounds, burns, ulcers, osteoporosis, osteoarthritis, periodontal 

CC diseases, Alzheimer's disease, Parkinson's disease, Huntington's disease 

CC and amyotrophic lateral sclerosis (ALS) . Proteins with activin/inhibin 

CC activity may additionally be useful as contraceptives. Nucleic acid 

CC sequences of the invention may be used in chromosome mapping, and as a 

CC source of diagnostic primers and probes. The present sequence represents 

CC one of the 40 proteins of the invention 

XX 

SQ Sequence 893 AA; 

Query Match 74.3%; Score 4400; DB 3; Length 893; 

Best Local Similarity 98.9%; Pred. No. 2e-219; 

1; Conservative 1; Mismatches 7; Indels 2; Gaps 2; 

MGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEV 345 
M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I i I I I I I I II I M I I I I I I I I 
MGS SFSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEV 60 

VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESK 4 05 
I I M M I M I I I I I I I I I I I I I I I I II I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | 



I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I II I I I II I II I I I I I I I I I I 



I I M Mill I I I I | I | I I I I I I I | I I I II I I I I I I I I I I I I I I || I I I | | | | | I I I 



I I I I I I I I I I I I I I I I II I I M I I I I I M II I I I I I I I I I I I I I I I I I I I I I I 



Matches 


88. 


Qy 


286 


Db 


1 


Qy 


346 


Db 


61 


Qy 


406 


Db 


121 


Qy 


466 


Db 


181 


Qy 


526 


Db 


241 



Qy 586 FEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPY 645 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II | I I 
Db 301 FEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPY 360 

Qy 646 EEAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYS 704 

M I I I I I I I I I I I I II I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I M I I II I I I I 
Db 361 EEAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYS 420 

Qy 7 05 EMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESM 764 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I | I I 
Db 421 EMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESM 480 

Qy 7 65 IEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAV 824 

I I M I M I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I II I I I I I I I I II I I I I I I | || 
Db 481 IEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAV 540 

Qy 825 YSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSE 884 

I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I II I I I I I I I I I I I I I 
Db 541 YSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSE 600 

Qy 885 IANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALG 944 

I I I I I I I I I II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 601 IANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA 660 

Qy 945 HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTG 1004 

I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I I I I II I I | | M 
Db 661 -TQAEI ESI WPKVLVKEAEKKLPSDTEKEDRS PSAI FSAELSKTSWDLLYWRDIKKTG 719 

Qy 1005 WFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLE 1064 

I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I II I I I I I I I || I || 
Db 720 WFGAS LFLLLS LTVFS I VSVTAYIALALLSVT I S FRI YKGVI QAIQKS DEGHPFRAYLE 779 

Qy 1065 SEVAISEELVQKYSNST^GHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGL 1124 

I I I I I I I M II II I I I II I I I I I.I I I II II I I I I | || || | M | M | | | | | | | | | | M | M 
Db 780 SEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGL 839 

Qy 1125 TLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 117 8 

I I M I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I | | | | | | | | || | 
Db 840 TLLIIALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 8 93 



RESULT 12 
AAY71310 

ID AAY71310 standard; protein; 1163 AA. 
XX 

AC AAY71310; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat neurite growth inhibitor Nogo A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 



KW 


structural plasticity; screening. 




XX 








OS 


Rattus sp . 






XX 








FH 


Key 


Location/Qualifiers 




FT 


Inhibitory- site 


1. .171 




FT 




/note= "Inhibits NIH 3T3 fibroblast spreading" 


FT 


Modif ied-site 


30 




FT 




/note= "Casein kinase II site" 




FT 


Region 


31. .58 




FT 




/note= "Acidic region" 




FT 


Region 


31. .57 




FT 




/note= "Region specifically described 


in claim 16" 


FT 


Region 


172. .259 




FT 




/note= "This region is not essential 


for inhibitory 


FT 




activity" 


FT 


Modif ied-site 


233 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


242. .244 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


291 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


295 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Misc-dif f erence 


404 




FT 




/note= "Encoded by TTG" 




FT 


Modif ied-site 


436 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


468. .470 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


484 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


488 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


502 




FT 




/note= "Casein kinase II site" 




FT 


Inhibitory- site 


542. .722 




FT 


Modif ied-site 


576 




FT 




/note= "Casein kinase II site" 




FT 


Peptide 


623. .640 




FT 




/note= "used as immunogen to generate 


antibody AS 4 72" 


FT 


Modif ied-site 


626 


FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


694. .696 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


715 




FT 




/note= "Casein kinase II site" 




FT 


Peptide 


762. .1163 




FT 




/note= "used as immunogen to generate 


antibody AS Bruna 


FT 


Modif ied-site 


784 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


821 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


850 




FT 




/note= "Protein kinase C (PKC) site" 




FT 


Modif ied-site 


855 




FT 




/note= "Protein kinase C (PKC) site" 





FT 


Modified- site 


863 




FT 




/note= "Casein kinase II site" 




FT 


Modified- site 


868 




FT 




/note= "Protein kinase C (PKC) 


site" 


FT 


Modif ied-site 


893 




FT 




/note= "Protein kinase C (PKC) 


site" 


FT 


Modif ied-site 


912. .914 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


925. .927 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


954 




FT 




/note= "PKC and casein kinase 


II sites" 


FT 


Modif ied-site 


956 




FT 




/note= "PKC and casein kinase 


II sites" 


FT 


Region 


975. .1162 




FT 




/note= "This region is not essential for inhibitory 


FT 




activity" 




FT 


Region 


976. .1163 




FT 




/note= "C-terminal common region found in Nogo A, B 


FT 




isof orms " 




FT 


Domain 


988. .1023 




FT 




/label= Transmembrane domain 




FT 




/note= "C-terminal hydrophobic 


region specifically 


FT 




described in claim 16" 


FT 


Modif ied-site 


1024 




FT 




/note= "Protein kinase C (PKC) 


site" 


FT 


Modif ied-site 


1071. .1073 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


1073 




FT 




/note= "Protein kinase C (PKC) 


site" 


FT 


Modif ied-site 


1089 




FT 




/note= "Protein kinase C (PKC) 


site" 


FT 


Domain 


1090. .1125 




FT 




/label= Transmembrane domain 




FT 




/note= "C-terminal hydrophobic 


region specifically 


FT 




described in claim 16" 




FT 


Modif ied-site 


1141. .1143 




FT 




/note= "Asn is N-glycosylated" 




FT 


Modif ied-site 


1143 




FT 




/note= "Protein kinase C (PKC) 


site" 


XX 








PN 


WO200031235-A2 , 






XX 








PD 


02-JUN-2000. 






XX 








PF 


05-NOV-1999; 


99WO-US026160. 




XX 








PR 


06-NOV-1998; 


98US-0107446P. 




XX 








PA 


(SCHW/) SCHWAB 


M E. 




PA 


(CHEN/) CHEN M 


S. 




XX 








PI 


Schwab ME, Chen MS; 




XX 








DR 


WPI; 2000-400052/34. 




DR 


N-PSDB; AAD01173. 




XX 









PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Claim 3; Fig 2A; 122pp; English. 
XX 

CC The present sequence is a rat Nogo A protein which is a potent neural 

CC cell growth inhibitor and is free of all central nervous system (CNS) 

CC myelin material with which it is natively associated. The protein was 

CC derived from a cDNA generated by fusing R018U37-3, R1-3U21 cDNAs isolated 

CC from hexanucleotides-primed rat brain stem/spinal cord library, and 01il8 

CC cDNA from an oligo d(T) -primed rat oligodendrocyte library. Nogo proteins 

CC and fragments displaying neurite growth inhibitory activity are used in 

CC the treatment of neoplastic disease of the CNS e.g. glioma, glioblastoma, 

CC medulloblastoma, craniopharyngioma, ependyoma, pinealoma, 

CC haemangioblastoma, acoustic neuroma, oligodendroglioma, menagioma, 

CC neuroblastoma or retinoblastoma and degenerative nerve diseases e.g. 

CC Alzheimer's and Parkinson's diseases. Therapeutics which promote Nogo 

CC activity can be used to treat or prevent hyperprolif erative or benign 

CC dysproliferative disorders e.g. psoriasis and tissue hypertrophy. 

CC Ribozymes or antisense Nogo nucleic acids can be used to inhibit 

CC production of Nogo protein to induce regeneration of neurons or to 

CC promote structural plasticity of the CNS in disorders where neurite 

CC growth, regeneration or maintenance are deficient or desired. The animal 

CC models can be used in diagnostic and screening methods for predisposition 

CC to disorders and to screen for or test molecules which can treat or 

CC prevent disorders or diseases of the CNS. Note: The present sequence 

CC designated as SEQ ID NO: 2 is stated to be the same as the sequence shown 

CC in Fig. 13 (see AAY71384) of the specification. However, this sequence 

CC does not match the sequence given in Fig. 13. SEQ ID numbers 35-42 are 

CC referred in claim 32 and SEQ ID NO: 29 in disclosure of the 

CC specification. However, the specification does not include sequences for 

CC these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 



Query Match 72.5%; Score 4296.5; DB 3; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 6.2e-214; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19; 



Qy 


l 


MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 

1 1 i : 1 1 1 Mill 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 1 1 II : 1 1 :: 1 1 1 1 1 1 1 I I I 1 I I 

MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 


58 


Db 


l 


60 


Qy 


59 


P AAGL S AAP VPT APAAGAP LMD FGND FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VP 
1 1 1 1 1 1 1 1 M 1 II 1 1 1 : I I : I | | | | | | | | | | | | M 1 1 1 1 1 1 1 1 : II : 1 
PAAGL S AAAVP - - PAAAAP LLDFSSDS VP PAP RG P L P AAP PAAP E RQ P S W E RS P AA P 


118 


Db 


61 


115 


Qy 


119 


APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 
IN 1 M 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | 1 1 1 1 1 1 II 1 1 1 1 1 
APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAP P S T PAAP KR 


178 


Db 


116 


166 


Qy 


179 


R GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 
HI : 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 
RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 


224 


Db 


167 


226 


Qy 


225 


SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 


284 



Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGS S FSVS PKAESAVT VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDE 344 

I I I I I I I I I I I I : : I I : M : | | : : | | : | : I I : I | ■ I | | | 

Db 287 EMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG -KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS-DMLAAGGKIESNLE 403 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I ! : | : | 

Db 341 WS P E KTMD I FN EMQM S WAP VRE E YAD F K P FEQ AW E VKDT YE G S RDVLAA RANVE 396 

Qy 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 4 63 

I I M : I I I I I I I : I I I I I : I I I I I I I I : I I I I I I I I I I : | |l I 
Db 397 SKVDRKCIEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

Qy 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I I I I I I I I I I I I I I I I I I I : I II I : I I I II I II I I I II I I I I MINI I : I 
Db 456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

I M I : I I I I I I I I I I I I I I I I I I I || || I I | | || | | : | | || | | | : | | | | | | I ! I I 
Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

I M II : I I I I I I I I I I I I I I II I I I : I II I I I I : I I I I I I I I I : I : I I I Mill 
Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

I I I I I M : I : II I II Mill: I II : I I I I I I I I M I I I I I I I I I I I I I : I I I I : 
Db 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

I I I : M M I I : I : I I I I II I I : I I I I I I I M I II I : I II Ml I I I : I II I I I I I 
Db 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

Qy 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : : I : I I I I I I II I I I I I : : I MM :: : I M I II I I I II I I : I 

Db 754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

I : I I I I I I II I M M M I I I I II I II II II I I I MM II I I I M II I I I I I I 

Db 812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVS7UCDDS-PKLAKEYTDLEVSDK 870 

Qy 883 SEIT^APDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

I I I I I I I II I I I II I II II I II : : : I I M I M I : I I : I M II 
Db 871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

I I M Mill I I II I II M I I I I I I I I I II: MM I I I I I I I I I I I I I I I 

Db 929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

Qy 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

II I I M I I II I II II II I I I I II I I I II II II I I II II II II I II I I II I I I I I I II I I I 

Db 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

II I I M I I M I I I I I I I II I I I I I I II II I I I I II II I I | || M I I II II I II II I II I 
Db 1048 L E S EVAI SEE LVQ K Y S N S AL GHVN S T I K E L RRL FLVD D LVD S L K FAVLMWVFT YVGAL FN 1107 



Qy 1123 GLTLLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

I I I I I I I I I I I I II : I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I : 
Db 1108 GLTLLILALI SLFS I PVI YERHQVQIDHYLGIANKSVKDAMAKIQAKI PGLKRKAD 1163 



RESULT 13 
ABB81074 

ID ABB81074 standard; protein; 1163 AA. 
XX 

AC ABB81074; 
XX 

DT 05-NOV-2002 (first entry) 
XX 

DE Rat neurotransmitter receptor protein Nogo-A. 
XX 

KW Nerve regeneration; neuroprotection; neuronal degeneration; CNS; PNS; 

KW central nervous system; peripheral nervous system; tranquillizer; Nogo; 

KW vulnerary; cerebroprotective; anti-tumour; antidiabetic; anticonvulsant; 

KW nootropic; antiparkinsonian; ophthalmological ; analgesic; hepatotropic; 

KW osteopathic; vasotropic; nephrotropic; cytostatic; antigen; gene therapy 

KW neurotransmitter receptor; rat; receptor. 
XX 

OS Rattus norvegicus. 
XX 

PN US2002072493-A1. 
XX 

PD 13-JUN-2002. 
XX 

PF 28-JUN-2001; 2001US-008 93348 . 
XX 

PR 19-MAY-1998; 98IL-00124 500 . 

PR 21-JUL-1998; 98WO-US014715 . 

PR 22-DEC-1998; 98US-002 18277 . 

PR 19-MAY-1999; 99US-00314161 . 
XX 

PA (YEDA ) YE DA RES & DEV CO LTD. 
XX 

PI Eisenbach-Schwartz M, Hauben E, Cohen IR, Beserman P, Mosonego A; 

PI Moalem G; 

XX 

DR WPI; 2002-607255/65. 

DR N-PSDB; ABN86600. 
XX 

PT Promoting nerve regeneration and preventing neuronal degeneration in the 

PT central/peripheral nervous system from injury/disease, comprises 

PT administering nervous system-specific activated T cells/antigen, or 

PT analogs/peptides . 

XX 

PS Example 5; Page 44-47; 93pp; English. 
XX 

CC The invention relates to promoting nerve regeneration or conferring 

CC neuroprotection and preventing or inhibiting neuronal degeneration in the 

CC central/peripheral nervous system (NS) . The method involves administering 

CC NS-specific activated T cells, NS-specific antigen, its analogue or its 

CC peptide, a nucleotide sequence the NS-specific antigen or its analogue oi 

CC combinations. The method is useful for promoting nerve regeneration and 



CC preventing neuronal degeneration in central/peripheral nervous system 

CC from injury/disease, where the injury is spinal cord injury, blunt 

CC trauma, penetrating trauma, hemorrhagic stroke, ischaemic stroke or 

CC damages caused by surgery such as tumour excision. The disease is not an 

CC autoimmune disease or neoplasm. The disease results in a degenerative 

CC process occurring in either gray or white matter or both. The disease is 

CC diabetic neuropathy, senile dementia, Alzheimer's disease, Parkinson's 

CC disease, facial nerve (Bell's) palsy, glaucoma, Huntington's chorea, 

CC amyotrophic lateral sclerosis, non-arteritic optic neuropathy, and 

CC vitamin deficiency, intervertebral disc herniation, prion diseases such 

CC as Creutzfeldt-Jakob disease, carpal tunnel syndrome, peripheral 

CC neuropathies associated with various diseases, including but not limited 

CC to uremia, porphyria, hypoglycemia, Sjorgren Larsson syndrome, acute 

CC sensory neuropathy, chronic ataxic neuropathy, biliary cirrhosis, primary 

CC amyloidosis, obstructive lung diseases, acromegaly, malabsorption 

CC syndromes, polycythemia vera, immunoglobulin (Ig)A- and IgG gamma- 

CC pathies, complications of various drugs (e.g., metronidazole) and toxins 

CC (e.g., alcohol or organophosphates ) , Charcot-Marie-Tooth disease, ataxia 

CC telangectasia, Friedreich's ataxia, amyloid polyneuropathies, 

CC adrenomyeloneuropathy, Giant axonal neuropathy, Ref sum's disease, Fabry's 

CC disease, or lipoproteinemia . The present sequence represents the rat 

CC neurotransmitter receptor protein Nogo-A, an example of NS-specific 

CC antigen 

XX 

SQ Sequence 1163 AA; 

Query Match 72.5%; Score 4296.5; DB 5; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 6.2e-214; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19; 

QY 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

111 = 111 I I I I I I I I I I I I I I I I I I I MINI I I I I I : I I :: I I I I I I I I I I I | | 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

QY 59 P AAGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP 118 

I I I I I I I I II I M I M : I I : I I I I I I I I I I | M I I I I I I I I ! I : | | : I 
Db 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAP PAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

HI I I I I I I I I I I II M I I I I I II I I I I I I I I I I I I I I I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

HI : I I M I I II I I I : I : I I I M I I I I I Ill 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

I M I II I I I I I I I I I I : I I I : : I : : I I I I I : I : I :: I I I I I M I I I I 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

QY 285 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

MINI III ll|::| I :M:||::||:|: || : || || | ||| 
Db 287 EMGSSFKGSPKGESAI LVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 403 

III II I I II ::: I I I : I I I I I I I I M I : I I I I I : I | | : | M : | : | 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 



Qy 4 04 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIAT 4 63 

Nihil Mill: I I I I I : I I I I I I II : I I I I I M I I I : I II I 
Db 397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

QY 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I I I I I I I I I I I I I I I I I I I : I I I I : I I I II I I I I I I II I I I I II I I I I I : I 
Db 456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

III I : I I I I I I I I I I I I I I I I I I I M II I I I I I I I I : I I I I I I I : I I I I I I I I I I 
Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 57 4 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

I I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I : I I I II I I I I : I : I I I I I I I I 
Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

I M I I I I : I : I I I II Mill: M I : I I I II I I II II II I II M M I I I : I I I I : 
Db 635 PPYEEAMNV7VLKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

IIMII I : I I : I : I I II II I I : I I I I I II II M II : M I Ml I I I : II I II I I I 
Db 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

Qy 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : : : : I : I II I I I I II I I I I : : I MM : : : II : II II I I II I I : I 

Db 754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

M I I M I I II I : M I : I II I II M I I II I II I I MM II I I I M I I I II I I I 
Db 812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

Qy 8 83 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

Mill I I I I I I II I II I I II M : : : I I M I M I : I I : I : I I I 
Db 871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

I I M I I M I I I M I I II II M I I I I II II: MM I II I I I I I II I I M I 
Db 929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

Qy 1003 TGWFGASLFLLLSLTVFSIVSA/TAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

I II I I I M I II I I II II I I I II I I I II I I I I M II I I M II I I II I II M M I M I I I II 

Db 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I II M I I I I I I II I M II I I I I I I I M 
Db 1048 L E S E VAI SEE L VQK Y S N S AL GH VN S T I KE L RRL FL VD D LVD S L K FAVLMWVFT YVGAL FN 1107 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I II I I II I II I M I II I I I I I II I I I I II II M I I M I II I M I I I I I I I I : 
Db 1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 

RESULT 14 
AAY71557 

ID AAY71557 standard; protein; 1162 AA. 



XX 

AC AAY71557; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Rat Nogo A truncated protein used in the construction of mutant Nogo-A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 

KW central nervous system; neoplastic disease; antiproliferative; glioma; 

KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 

KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

KW hyperprolif erative disorder; benign dysprolif erative disorder; diagnosis; 

KW psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 

KW structural plasticity; screening; mutant; mutein. 

XX 

OS Rattus sp. 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US02 6160 . 
XX 

PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 

PT Nogo proteins and nucleic acids useful for treating neoplastic disorders 

PT of the central nervous system and inducing regeneration of neurons. 

XX 

PS Example; Page; 122pp; English. 
XX 

CC The patent relates to neurite growth inhibitor Nogo which is free of all 

CC central nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma , acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson's diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperprolif erative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. The present sequence 

CC is a truncated form of rat Nogo A protein shown in AAY71310, which is 

CC used in the construction of mutant Nogo-A. Nogo-A is composed of His- 

CC tag/T7-tag/vector/Nogo-A sequence aa 1-1162. Nogo A deletion mutants were 



CC used for mapping the inhibitory sites of Nogo protein. Major inhibitory 

CC region was identified in the Nogo A sequence from amino acids 172-974, 

CC particularly amino acids 542-722. In addition, N-terminal region 1-171 

CC was found to be inhibitory to NIH 3T3 fibroblast spreading. Note: The 

CC present sequence is not given in the specification but is derived from 

CC rat Nogo A sequence shown in AAY71310. SEQ ID numbers 35-42 are referred 

CC in claim 32 and SEQ ID NO: 29 in disclosure of the specification. 

CC However, the specification does not include sequences for these SEQ ID 

CC numbers 
XX 

SQ Sequence 1162 AA; 

Query Match 72.5%; Score 4294.5; DB 3; Length 1162; 

Best Local Similarity 74.1%; Pred. No. 7.9e-214; 

Matches 885; Conservative 103; Mismatches 156; Indels 51; Gaps 19 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

111:111 I I I I I I I I II I I I I I I I I I I I I I I I I I I I I : I I :: I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 PAAGLS7\APVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

I I I I I I I I II III 111:11 : I I I I I I II I I I I I I I I I I I I I M : I i : I 

Db 61 PAAGLSAAAVP — PAAAAPLLDFSSDSVPPAPRGPLPAAPPAAPERQPSWERSPAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

III I I I I I I I I II I I I I I I I I I I I I I I I I I I | | | | | | | | M II I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

Ml : M I I I I I I I I I : I : I I I I I I I I I I I I ! M I I I 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

MM I II I I I I I II I I : | | | : : | : : | | M | : | : | : : I I I M I M I II 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 285 EMGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDE 344 

II I I I I II I I I I :: I I : I I : M : : II : I : M : II III Ml 

Db 287 EMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 4 03 

I I I II I I I I ::: I I M I I I II I II I I I : M M I: I I I : I I I : I : I 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Qy 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

M M : II Mill: MM M I I I I II I I : M I II II I I I Mill 
Db 397 SKVDRKCIEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

Qy 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I II I M I MM I II II I I I : I I II : I II M II II I I II II II MUM Ml 

Db 456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

M I M I II II I I II II I || I I I II I I I II I II II I I : || I I I || : || | | || MM 
Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 



Qy 



584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 



Db 


575 


PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 


634 


Qy 


643 


PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 

1 1 1 1 1 1 1 : 1 : 1 1 1 II Mill: 1 1 1 • 1 1 II 1 II 1 1 1 1 1 I If 1 1 I I 1 I I I * I I I I • 
1 1 1 1 1 1 1 1 • 1 1 i ii i i i i i • i i i - i i i i i i i i i i i i i i i i i i i i i l I • l f | l • 

PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 


702 


Db 


635 


694 


Qy 


703 


YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 

111:11 1 : 1 1 : 1 : 1 1 1 1 1 1 1 1 • 1 1 1 1 1 1 1 f 1 1 1 1 1 • 1 1 1 1 • l I I I • i i i I i i i i 
1 1 1 • 1 1 f * i i • i • ii ii ii i i • i ii i i i ii i i i i i . i i i f . | l l l • | | | | 1 1 1 1 

YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 


7 62 


Db 


695 


753 


Qy 


763 


SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 

: : : : : : 1 : 1 1 1 1 I 1 1 1 1 1 1 1 1 - • 1 -Ml • • * i i - i i i i i i i i i i . i 
i • i i i i i i i i i ii ii • • I -ill ... 1 1 > 1 1 1 1 1 Mill " 1 

TVAQHK-EERLSAS PQELGKPYLESFQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEFNT 


822 


Db 


754 


811 


Qy 


823 


AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 

1:111111 III - 1 * 1 * 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 -1*1 II M I > I i i i i i i i i 
i ■ i i i i i i iii • i • i • i i I I I I I I I I I I I I I l I « I - I ll 1 1 1 - 1 1 M II 1 1 1 

AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 


882 


Db 


812 


870 


Qy 


883 


SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 

1 1 1 1 1 II MM Ml III III II • • • M-ll-l I . ii . i .mi 
i i i i i m i i i i iii iii iii ii ... i i . i i . | | . || : | : | | [ 

SEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASISPSNVSA 


942 


Db 


871 


928 


Qy 


943 


LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 

1 1 1 : MM 1 1 M M 1 1 1 1 1 1 M 1 1 II 1 II* 11*1 I 1 II I 1 1 i i i i i i m 
1 1 ■ * i i i i i i i i i i i i i i i i i i i i i i i ii* i i . i I I I I f l l l l ll | | If 

L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 


1002 


Db 


929 


987 


Qy 

Db 


1003 
988 


TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 

1 M 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 i 1 1 1 1 1 1 1 1 I I I I I I ll ll I I I i i i i i i i i i I i i i i i i i 
■ ' i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i | | II 1 1 1 1 1 1 i 1 1 1 1 1 1 M M 1 M 1 1 

TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 


1062 
1047 


Qy 

Db 


1063 
1048 


LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 
1 1 1 1 1 1 M 1 II II II 1 II II 1 II 1 M 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 | | M 1 1 1 1 1 II 1 II 
LESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFN 


1122 
1107 


Qy 

Db 


1123 
1108 


GLTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKA 1177 
1 1 1 1 1 1 1 1 M II II : II 1 1 1 II 1 1 II II 1 1 II 1 1 : \ II II 1 1 II II 1 1 1 1 II 1 1 
GLTLLI LALI SLFS I PVIYERHQVQIDHYLGLANKSWDAMAKIQAKI PGLKRKA 1162 





RESULT 15 
AAY71384 

ID AAY71384 standard; protein; 1163 AA. 
XX 

AC AAY71384; 
XX 

DT 02-NOV-2000 (first entry) 
XX 

DE Alternative version of rat neurite growth inhibitor Nogo A. 
XX 

KW Rat; neurite growth inhibitor; Nogo A; neural cell; myelin; CNS; 
KW central nervous system; neoplastic disease; antiproliferative; glioma; 
KW antisense gene therapy; neuroblastoma; menagioma; retinoblastoma; 
KW degenerative nerve disease; Alzheimer's disease; Parkinson's disease; 

hyperproliferative disorder; benign dysprolif erative disorder; diagnosis; 
psoriasis; tissue hypertrophy; neuronal regeneration; treatment; 
KW structural plasticity; screening. 



KW 
KW 



XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT Inhibitory-site 1. .171 

FT /note= "Inhibits NIH 3T3 fibroblast spreading" 

FT Modified-site 30 

FT /note= "Casein kinase II site" 

FT Region 31. .58 

FT /note= "Acidic region" 

FT Region 172. .259 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Misc-dif ference 223 

FT /label= Unknown 

FT /note= "There is Leu at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 233 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 242. .244 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 291 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 295 

FT /note- "Protein kinase C (PKC) site" 

FT Misc-dif ference 404 

FT /note= "There is lie at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 436 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 468. .470 

FT /note= "Asn is N-glycosylated" 

FT Misc-dif ference 469 

FT /label= Unknown 

FT /note= "There is Lys at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 484 

FT /note- "Protein kinase C (PKC) site" 

FT Modified-site 488 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 502 

FT /note- "Casein kinase II site" 

FT Inhibitory-site 542. .722 

FT Modified-site 576 

FT /note= "Casein kinase II site" 

FT Peptide 623. .640 

FT /note= "used as immunogen to generate antibody AS 472" 

FT Modified-site 626 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 661 

FT /note= "There is Asn at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 694. .696 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 715 

FT /note= "Casein kinase II site" 

FT Peptide 762. .1163 



FT /note= "used as immunogen to generate antibody AS Bruna" 

FT Modified-site 784 

FT /note= "Protein kinase C (PKC) site" 

FT Misc-dif ference 820 

FT /note= "There is Leu at this position in the sequence 

FT shown in AAY71310" 

FT Modified-site 821 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 850 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 855 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 863 

FT /note= "Casein kinase II site" 

FT Modified-site 868 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 893 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 912. .914 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 925. .927 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 954 

FT /note= "PKC and casein kinase II sites" 

FT Modified-site 956 

FT /note= "PKC and casein kinase II sites" 

FT Region 975. .1162 

FT /note= "This region is not essential for inhibitory 

FT activity" 

FT Region 976. .1163 

FT /note= "C-terminal common region found in Nogo A, B and C 

FT isoforms" 

FT Domain 98 8. .1023 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region" 

FT Modified-site 1024 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1071. .1073 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1073 

FT /note= "Protein kinase C (PKC) site" 

FT Modified-site 1089 

FT /note= "Protein kinase C (PKC) site" 

FT Domain 1090. .1125 

FT /label= Transmembrane_domain 

FT /note= "C-terminal hydrophobic region" 

FT Modified-site 1141. .1143 

FT /note= "Asn is N-glycosylated" 

FT Modified-site 1143 

FT /note= "Protein kinase C (PKC) site" 
XX 

PN WO200031235-A2. 
XX 

PD 02-JUN-2000. 
XX 

PF 05-NOV-1999; 99WO-US026160 . 
XX 



Nogo proteins and nucleic acids useful for treating neoplastic disorders 
of the central nervous system and inducing regeneration of neurons. 



PR 06-NOV-1998; 98US-0107446P . 
XX 

PA (SCHW/) SCHWAB M E. 

PA (CHEN/) CHEN M S. 
XX 

PI Schwab ME, Chen MS; 
XX 

DR WPI; 2000-400052/34. 
XX 
PT 
PT 
XX 

PS Claim 3; Fig 13; 122pp; English. 
XX 

CC The present sequence is an alternative version of rat Nogo A protein 

CC which is a potent neural cell growth inhibitor and is free of all central 

CC nervous system (CNS) myelin material with which it is natively 

CC associated. Nogo proteins and fragments displaying neurite growth 

CC inhibitory activity are used in the treatment of neoplastic disease of 

CC the CNS e.g. glioma, glioblastoma, medulloblastoma, craniopharyngioma, 

CC ependyoma, pinealoma, haemangioblastoma, acoustic neuroma, 

CC oligodendroglioma, menagioma, neuroblastoma or retinoblastoma and 

CC degenerative nerve diseases e.g. Alzheimer's and Parkinson f s diseases. 

CC Therapeutics which promote Nogo activity can be used to treat or prevent 

CC hyperproliferative or benign dysprolif erative disorders e.g. psoriasis 

CC and tissue hypertrophy. Ribozymes or antisense Nogo nucleic acids can be 

CC used to inhibit production of Nogo protein to induce regeneration of 

CC neurons or to promote structural plasticity of the CNS in disorders where 

CC neurite growth, regeneration or maintenance are deficient or desired. The 

CC animal models can be used in diagnostic and screening methods for 

CC predisposition to disorders and to screen for or test molecules which can 

CC treat or prevent disorders or diseases of the CNS. Note: The present 

CC sequence is an alternative version of the Nogo A sequence shown in Fig. 

CC 2A (see AAY71310) . SEQ ID numbers 35-42 are referred in claim 32 and SEQ 

CC ID NO: 29 in disclosure of the specification. However the specification 

CC does not include sequences for these SEQ ID numbers 

XX 

SQ Sequence 1163 AA; 

Query Match 72.4%; Score 4286.5; DB 3; Length 1163; 

Best Local Similarity 74.1%; Pred. No. 2.1e-213; 

Matches 886; Conservative 104; Mismatches 155; Indels 51; Gaps 19; 

1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 
I I I : I I I I I I I I I I I I I I I M I I I I I I I I I I | I I I I I : I I :: I I I I I I | | | | | | | 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

QY 59 P AAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP 118 

I I t I I I I I II I I I I I I : I I : I | | | | | | || | | | | || | | | | | | | | : | | ; | 
Db 61 P AAG L S AAAVP P AAAAP LLDFSSDSVP PAP RG P L P AAP P AAP ERQPSWERS P AA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKR 178 

HI I I I I I I I I I I I I I I I I I I I | || I I I I I I I I I I I I I 

116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 



Db 



Qy 179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

HI : I I I II II I I I I : I : I I I I I II II II Ill 



Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETT^ASXPSL 22 6 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : I II I I II I | I I 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 

Qy 2 85 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 

I I I I I I I I I I I I : : I I : I I : I I : : I I : I : I I : I I I I I | | | 

Db 287 EMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KEDR 340 

Qy 345 WS SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDSKEDS - DMLAAGGKI ESMLE 4 03 

I I I I I I I I I ::: I I I : I I I I I I I I I I I : I I I I I : I I I : I I I : | : | 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Qy 404 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 4 63 

Nihil Mill: I I I I I : I I M I I I I : I I I I I I I M I : I I I I 
Db 397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

Qy 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I I I I I M I I I I I I I I I I I I I : I I I I : II I II MINI II I I I I MINI I : I 
Db 456 NTFPLLEDHTSENXTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

Ml I : I II I I I I I I I I I I I I I I M I I I I I I I I I I II : I I I I I II : I I I I I I I I I I 
Db 515 VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

I I I M : M I I I I I I I I I II I I I I I I : I II I I I I : I I I Mill I : I : I I I I I I I I 
Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLS7VEPAPDFSD 702 

I M II I I : I : I I I II Mill: II : I II II II II II I I I II M I II I I : II I I : 
Db 635 PPYEEAMNVALKALGTKEGIKEPESFMAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

I M : I I I : II : I : I II II I I I : I I I II I I I I II II : I M Ml M I : I II I I I I I 
Db 695 YSEIAKFEKSVPEH7VELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

Qy 763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : : : M : II I I I I I I I I I II : : I MM : : : I I : M I I I Mill : I 

Db 754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 8 82 

I M II I I I III : I : I : II I I I I II I I I II I I I I MM II I || : I I I II II I I 
Db 812 AIYSNDDLSSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PK1AKEYTDLEVSDK 870 

Qy 883 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

Mill I I I I I I I II I II I M II : : : I I : I I : | | : I | : I : II I 
Db 871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

I I I : M II I I II II I I II I I M I I I II II: MM I I II I II I I I I I II I 
Db 929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

Qy 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

M I I II I II M I I I I II I I II I I I I II I I I I I I I I II II I II I I M I II I II I I I I I II I 
Db 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 



Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVXMWVFTYVGALFN 1122 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 104 8 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGIANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I II I I I I I I I I I I : I I I I I I II I I I I I I I I I I I : I I I I I I I I I I I I I II I I I I : 
Db 1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 



Search completed: September 29, 2004, 18:14:00 
Job time : 108.384 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: September 29, 2004, 18:09:33 ; Search time 29.2441 Seconds 

(without alignments) 
2079.581 Million cell updates/sec 

Title: US-09-830-972-29 
Perfect score: 5923 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB .pep : * 
/ cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 
/ cgn2_6/ptodata/2/iaa/6A_COMB.pep : * 
/cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 
/cgn2_6/ptodata/2/iaa/PCTUS_COMB .pep : * 
/ cgn2_6/ptodata/2/iaa/backf ilesl .pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-700-607-1 

; Sequence 1, Application US/08700607 
; Patent No. 5858708 

GENERAL INFORMATION: 

APPLICANT: Bandinan, Olga 
APPLICANT: Au- Young, Janice 
; APPLICANT: Goli, Surya K. 

APPLICANT: Hillman, Jennifer L. 
; TITLE OF INVENTION : TWO NOVEL HUMAN NSP-LIKE PROTEINS 

NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
; STATE: CA 

COUNTRY: U.S. 
; ZIP: 94304 



COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/7 00 , 607 
; FILING DATE: Filed Herewith 

ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
; REFERENCE/DOCKET NUMBER: PF-0114 US 

; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 199 amino acids 

TYPE: amino acid 
; STRANDEDNESS: single 

TOPOLOGY: linear 
; MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

LIBRARY: 
; CLONE: Consensus 

US-08-700-607-1 



Query Match 15.7%; Score 931; DB 2; Length 199; 

Best Local Similarity 99.0%; Pred. No. 8e-50; 

Matches 189; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 
1 M I I I I I I I I I I I I I I I I I I I I I M I II I I I I I II I I I I I II I I I I I I II I II I I I I 
Db 9 KDKVVTiLLYWRDI KKTGVVFGASLFLLLSLTVFS I VS WAYI ALALLSVT I S FRI YKGVI 68 

QY 104 8 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I | | | | | | | | | | M I I II I I I I I I I I II I 
Db 69 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 128 

Qy 1108 AVLMWFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 

I I M I I I I I I I II II || I I I | | | | | | | | | | | | || | | | | | || | | | | | || | | | | | | | | | | | | 
Db 129 AVLMWVFT YVGALFNGLTLLI LALI S LFS VPVI YERHQAQI DHYLGLANKNVKDAMAKI Q 188 

Qy 1168 AKI PGLKRKAE 1178 

I I I I I I I I I I I 
Db 189 AKI PGLKRKAE 199 



RESULT 2 
US-08-700-607-5 

Sequence 5, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Bandman, Olga 
Au-Young, Janice 
Goli, Surya K. 
Hillman, Jennifer L. 



; TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 

NUMBER OF SEQUENCES : 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
; STREET: 3174 Porter Drive 

CITY: Palo Alto 
; STATE: CA 

COUNTRY: U.S. 
; ZIP : 94304 

COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

SOFTWARE : FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/7 00 , 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 77 6 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

LIBRARY: GenBank 
CLONE: 307307 
US-08-700-607-5 



Query Match 13.3%; Score 787.5; DB 2; Length 776; 

Best Local Similarity 31.9%; Pred. No. 3.6e-40; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25 

> EAT P S P VL P D I VMEAP LNS A VPSAGASVIQPSSSPLE— ASSVNY 632 

II I III: : II: I: I : : ||: | : | 



-KhhPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

: I I : : : I I : : I : I I : : I I | : : : I 



LPYISIACDL IKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVED 722 

HI M : | | ||:: 



Qy 


588 


Db 


141 


Qy 


633 


Db 


201 


Qy 


676 


Db 


260 


Qy 


723 


Db 


310 



APYID— DLSEEQRRAPQITTPVKITLTEIEPSVE TTTQEKTPEKQDICLK 309 

iSPDSEPVDLFS— DDS IPDVPQKQ DETVMLVKESLTETSF 761 

Ml : ll I M I : I | | : : | | : 
•SPDTVPTVTVSEPEDDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEA 363 



Qy 



762 ESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELS 821 



: II I : =1 III ||::: 

Db 364 -KGLSYETAENPRPVGQLADRP EVKARSGPPTIPSPLDHEA 403 



QV 822 TAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFP TLISSKTDSFS 868 

:: I I I I I : : I I : | | | 

404 SSAESGD SEIELVSEDPMAAEDALPSGYVSFGHVGGPPPSP 444 



Db 



Qy 869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSD' 921 

: I I : I: : I : : I : I I I III 

445 ASPSIQYSILREEREAELDSELIIESCDAS SAS EESPKREQDSPPMKPSALD 496 



Db 



Qy 922 DF SKNGSATSKVLL LP PDVSALGHTQAEI ES IVKP 956 

I'M I III II I I : : I 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

Qy 957 KVLEKEAEKKLPSDTEKEDRSPSA IFSADLGKTSWDLLYWRDIK 1001 

: ' : : : I i : I : | || : I I I M | | | | 
Db 55 2 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAIDLLYWRDIK 599 

Qy 1002 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1061 

= 11 = 111= I II Ml I I = I t I 11:111 || | || |:||:||:||MM:| 

Db 600 QTGIVFGSFLLLLFSLTQFSVVSVVAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKA 659 

Qy 1062 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALF 1121 

HI I : : I • I : I I I : = : I I I : I I I I I I I I I I I I I I I I I I II II : I II I I I I 

Db 660 YLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 719 

Qy 1122 NGLTLLI LALI SLFSVPVI YERHQAQI DH YLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

N I I I I : : I : : I : I : : M : | : I I | I I I I I I I : : : I I I I I I I I I I I | | 
Db 720 NGLTLLLMAWSMFTLPWYVKHQAQI DQ YLGLVRTHINAWAKIQAKIPGAKRHAE 776 



RESULT 3 
US-08-700-607-6 

Sequence 6, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700 , 607 



; FILING DATE : Filed Herewith 

ATTORNEY/ AGENT INFORMATION: 

NAME: Billings, Lucy J. 
; REGISTRATION NUMBER: 36,749 

REFERENCE/ DOCKET NUMBER: PF-0114 US 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 415-855-0555 
; TELEFAX: 415-845-4166 

; INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 356 amino acids 

TYPE: amino acid 
; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

LIBRARY: GenBank 

CLONE: 307309 
US-08-700-607-6 

Query Match 11.9%; Score 705; DB 2; Length 356; 

Best Local Similarity 42.6%; Pred. No. 1.4e-35; 

Matches 168; Conservative 55; Mismatches 86; Indels 85; Gaps 11; 

QY B03 EVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISS 862 

: I I : : : I I : : I I I : I : II II I : : I : : : 

Db 30 QYSILREEREAELDSELI IESCDASSASEESPKRE QDSPPMK PSALDA 77 

QY 863 KTDSFSKLAREYTDLEVSHKSEIANAPDGAGSL PCTELPHDLSLKNIQPKVEEKI SF 919 

111= - II III Ml 
Db 78 IREETGVRAEERAPSRRGLAEPGSFLDYPSTE PQPGPE 115 

Qy 920 SDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKKAEKKLPSDTEKEDRSPS 979 

Ml II I I: : I : | : ::||: 

Db 116 LPPGDGAL EPETPMLP RKPEEDSSSNQSPA 145 

Qy 980 A IFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVS 1024 

I =1 II : I I I I I I I II : M : I II : I I I I I I M : | | 
Db 146 ATKGPGPLGPGAPPPLLF LNKQKAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWS 202 

Qy 1025 VTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS7ULGH 1084 

I MM I I M MM I I II I M I M I M M II I : I I I I |: :|:| :|||:: : 
Db 203 WAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFY 2 62 

Qy 1085 WCTIKELRRLFLVDDLVI)SLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERH 1144 

M I M II II I II I I II II I I I I II I I : II I I I | | | | | | M : : I : : I : I : : I I : I : I 
Db 263 WSTLKELRRLFLVQDLVDSLKFAVLMWLLTWGALFNGLTLLLMAWSMFTLPWYVKH 322 

Qy 1145 QAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I M :: : I M I I I I I I M II 

Db 323 QAQ I D Q Y L G L VRT H I NAWAK I Q AK I P GAK RHAE 356 



RESULT 4 
US-08-700-607-7 

; Sequence 7, Application US/08700607 
; Patent No. 5858708 



GENERAL INFORMATION: 

APPLICANT: Bandman, Olga 
APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/7 00, 607 
FILING DATE: Filed Herewith 
ATTORNEY/AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-855-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 208 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 
LIBRARY: GenBank 
CLONE: 307311 
US-08-700-607-7 



Query Match 11.6%; Score 688; DB 2; Length 208; 

Best Local Similarity 67.5%; Pred. No. 6.9e-35; 

Matches 129; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 



QY 


988 


KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 
1 : : 1 1 1 1 M 1 1 1 : 1 1 : 1 1 1 : 1 II III I I : I I | | | : | | | | | | | | | | | | | I : 
KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSVVAYLALAALSATISFRI YKSVL 


1047 


Db 


18 


77 


Qy 


1048 


QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 
M : 1 1 : 1 II 1 1 i : 1 1 1 1 1 : : I : I : I I I :: : | | 1 : 1 1 1 I I I I I | | I || 1 1 1 1 
QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 


1107 


Db 


78 


137 


Qv 


1108 


AVLMWFTWGAliFNGLTLLILALISLFSVPVIYERHQAQIDHYLGL7\NKNVKDAMAKIQ 
Mill: 1 1 II 1 1 1 1 1 1 1 1 1 : : 1 : : 1 : I : : I I : | : 1 || 1 1 1 1 1 1 1 : : =1111 
AVLMW L LT YVGAL FN G L T L L LMAWSM FT L P WYVKHQAQ I DQ Y L G L VRT H I NAWAK I Q 


1167 


Db 


138 


197 



Qy 



1168 AKI PGLKRKAE 1178 



Db 



198 AK I P GAKRH AE 208 



RESULT 5 
US-08-700-607-8 

; Sequence 8, Application US/08700607 

; Patent No. 5858708 

; GENERAL INFORMATION: 

; APPLICANT: Bandman, Olga 

APPLICANT: Au- Young, Janice 
APPLICANT: Goli, Surya K. 
; APPLICANT: Hillman, Jennifer L. 

TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
; STATE : CA 

COUNTRY: U.S. 
; ZIP: 94304 

COMPUTER READABLE FORM: 
; MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 
; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700, 607 
; FILING DATE: Filed Herewith 

ATTORNEY/ AGENT INFORMATION: 
; NAME: Billings, Lucy J. 

REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 415-855-0555 

TELEFAX: 415-845-4166 
; INFORMATION FOR SEQ ID NO: 8: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2 67 amino acids 
; TYPE: amino acid 

; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
; IMMEDIATE SOURCE: 

LIBRARY: GenBank 
CLONE: 281046 
US-08-700-607-8 

Query Match 11.3%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. l.le-33; 

Matches 124; Conservative 34; Mismatches 29; Indels 0; Gaps 0; 
QY 988 KT S WDLL YWRDI KKTGWFGAS LFLLLS LTVFS IVSVTAYI ALALLSVTI S FRI YKGVI 1047 



n 1 * " ' i i • I i • i i i • i ii | M [ | : | I | || : | || || | | | | | | | | | : 

9 KSQAI DLL YWRDI KQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSAT IS FRI YKSVL 68 




Db 



Qy 



104 8 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 



' 1 1 1 ' 1 1 1 • 1 1 • •"*! -ill— : II f • I I I I I I I I I I If II II I 

69 QAVQK.TDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 128 




Db 



Qy 




Db 



Db 



1168 AKIPGLK 1174 
I I I I I : 
189 AKIPGAR 195 



RESULT 6 

US-09-149-476-563 

; Sequence 563, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al. 

; TITLE OF INVENTION: 18 6 Human Secreted proteins 

FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149,476 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US9 8/ 04 4 93 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/038,621 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,626 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,334 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,336 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,163 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/047,600 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,615 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,597 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,502 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,633 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,583 

? EARLIER FILING DATE: 1997-05-23 

} EARLIER APPLICATION NUMBER: 60/047,617 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,618 

; EARLIER FILING DATE: 1997-05-23 

\ EARLIER APPLICATION NUMBER: 60/047,503 



EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 



EARLIER APPLICATION NUMBER: 60/056,889 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE : 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 



; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,589 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,593 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,578 

EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/043,576 
; EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/047,501 

EARLIER FILING DATE: 1997-05-23 

EARLIER APPLICATION NUMBER: 60/043,670 

EARLIER FILING DATE: 1997-04-11 
; EARLIER APPLICATION NUMBER: 60/056,632 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,664 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,876 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,881 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,909 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,875 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,862 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,887 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/056,908 

EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/048,964 
; EARLIER FILING DATE: 1997-06-06 
; EARLIER APPLICATION NUMBER: 60/057,650 
; EARLIER FILING DATE: 1997-09-05 
; EARLIER APPLICATION NUMBER: 60/056,884 
; EARLIER FILING DATE: 1997-08-22 
; EARLIER APPLICATION NUMBER: 60/057,669 
; EARLIER FILING DATE: 1997-09-05 
; EARLIER APPLICATION NUMBER: 60/049,610 
; EARLIER FILING DATE: 1997-06-13 
; EARLIER APPLICATION NUMBER: 60/061,060 

EARLIER FILING DATE: 1997-10-02 

Query Match 9.1%; Score 541.5; DB 4; Length 168; 

Best Local Similarity 60.5%; Pred. No. 5e-26; 

Matches 101; Conservative 34; Mismatches 31; Indels 1; Gaps 1; 
Qy 1013 LLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEE 1072 



11 " 1 • i i i i i i i i i i i i • l l I • I I I I I • I I I • : : * I I 

1 MLLSLAAFSVISWSYLILALLSVTISFRIYKSVIQAVQKSEEGHPFKAYLDVDITLSSE 60 





Db 



Qy 



1073 LVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALI 1132 



• 1 • ■ ■ * * ' * t i i " i * i i i i i i i f i ii- I I I I I • I I I - I I I I I I : 

61 AFHNYMNAAMVHINRALKLIIRLFLVEDLVDSLKLAVFMWLMTYVGAVFNGITLLILAEL 120 




Db 



Qy 1133 SLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGL-KRKAE 117 8 

: I I I I : : I I : : : I I I I I : I : I I : I I I I I : I I : I : I I I 

Db 121 LIFSVPIVYEKYKTQIDHYVGIARDQTKSIVEKIQAKLPGIAKKKAE 167 



RESULT 7 
US-08-700-607-3 

Sequence 3, Application US/08700607 
Patent No. 5858708 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Bandman, Olga 
Au- Young, Janice 
Goli, Surya K. 
Hillman, Jennifer L. 
TITLE OF INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 
STREET: 3174 Porter Drive 
CITY: Palo Alto 
STATE: CA 
COUNTRY: U.S. 
ZIP : 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/700,607 
FILING DATE: Filed Herewith 
ATTORNEY/ AGENT INFORMATION: 
NAME: Billings, Lucy J. 
REGISTRATION NUMBER: 36,749 
REFERENCE/DOCKET NUMBER: PF-0114 US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-8 55-0555 
TELEFAX: 415-845-4166 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 241 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
IMMEDIATE SOURCE: 

LIBRARY: THP1NOB01 
CLONE: 31870 
US-08-700-607-3 

Query Match 8.7%; Score 513; DB 2; Length 241;. 

Best Local Similarity 47.7%; Pred. No. 4.7e-24; 

Matches 102; Conservative 37; Mismatches 55; Indels 20; Gaps 1; 

Qy 963 AEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI 1022 

M I I : : : I I I : I I I : I I I I I I I : I : 1 I I I I I : 



Db 



2 0 AEPSAPGGGGSPGACPALGTKSCSSSCAVHDLIXWRDVKKTGFVFGTTLIMLLSLAAFSV 7 9 



Qy 


1023 


Db 


80 


Qy 


1083 


Db 


140 


Qy 


1143 


Db 


200 



: I I : I : I I I I I I I I I I I I I I I I I I : I I I : I I I I I : I I 



I : I I I I I : I I I I II I II II: I I I I I : II I : I I I I | | : : | | | : : | 



I : : I I : : I 
-KVPSKTPWNRQK 213 



RESULT 8 

US-09-621-976-4600 

Sequence 4600, Application US/09621976 
Patent No. 6639063 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, J.B. 
APPLICANT: Jobert, S. 
APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
FILE REFERENCE: GENSET . 054 PR2 
CURRENT APPLICATION NUMBER: US/09/621,976 
CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent.pm 
SEQ ID NO 4600 
LENGTH: 75 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: 58 

OTHER INFORMATION: Xaa = His, Pro 
NAME/ KEY: UNSURE 
LOCATION: 28 

OTHER INFORMATION: Xaa = Met,Val 
NAME/ KEY: UNSURE 
LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/ KEY: UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa = Ser,Tyr 
US-09-621-976-4600 

Query Match 5.8%; Score 344.5; DB 4; Length 75; 

Best Local Similarity 82.6%; Pred. No. 1.9e-14; 

Matches 71; Conservative 0; Mismatches 4; Indels 11; Gaps 1; 

QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I | | | | | | | | | || | || 

Db 1 MEDLDQSPLVSSSDSPPRXQPAFKYQFXREPEDEE EDLEELEVLERKPA 49 

Qy 61 AGL S AAP VPTAPAAGAP LMD FGND FV 8 6 



Ill 1 1 1 1 1 1 1 II I II I II 1 1 1 1 1 1 

Dk 50 AGLXAAPVXTAPAAGAPLMDFGNDFV 75 

RESULT 9 

US-09-621-976-4601 

Sequence 4601, Application US/09621976 
Patent No. 6639063 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, J.B. 
APPLICANT: Jobert, S. 
APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
FILE REFERENCE: GENSET . 054PR2 
CURRENT APPLICATION NUMBER: US/09/621 , 976 
CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent. pm 
SEQ ID NO 4601 
LENGTH: 75 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/ KEY: UNSURE 
LOCATION: 58 

OTHER INFORMATION: Xaa - His, Pro 
NAME/KEY: UNSURE 
LOCATION: 2 8 

OTHER INFORMATION: Xaa = Met,Val 
NAME/KEY: UNSURE 
LOCATION: 19 

OTHER INFORMATION: Xaa = Pro, Gin 
NAME/ KEY : UNSURE 
LOCATION: 53 

OTHER INFORMATION: Xaa - Ser,Tyr 
US-09-621-976-4601 

Query Match 5.8%; Score 344.5; DB 4; Length 75; 

Best Local Similarity 82.6%; Pred. No. 1.9e-14; 

Matches 71; Conservative 0; Mismatches 4; Indels 11; Gaps ] 

1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I II II I II I I I I I II I II I I I I I 

1 MEDLDQSPLVSSSDSPPRXQPAFKYQFXREPEDEE EDLEELEVLERKPA 4 9 



Db 



Qy 61 AGL S AAP VPT APAAGAP LMD FGND FV 8 6 

Ml I I I I I I I I I I I I I I I I II I I I 
50 AGLXAAPVXTAPAAGAPLMDFGNDFV 75 



RESULT 10 
US-08-714-741-32 

Sequence 32, Application US/08714741 
Patent No. 6500613 
GENERAL INFORMATION: 

APPLICANT: Briles, David E. 
APPLICANT: McDaniel, Larry S. 



APPLICANT: Swiatlo, Edwin 
APPLICANT: Yother, Janet 
APPLICANT: Crain, Marilyn J. 
APPLICANT: Hollingshead, Susan 
APPLICANT: Tart, Rebecca 
APPLICANT: Brooks-Walter, Alexis 

TITLE OF INVENTION: PNEUMOCOCCAL GENES, PORTIONS THEREOF, 

TITLE OF INVENTION: EXPRESSION PRODUCTS THEREFROM, AND USES OF SUCH GENES 
TITLE OF INVENTION: PORTIONS AND PRODUCTS 
NUMBER OF SEQUENCES: 4 7 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Curtis, Morris & Safford, P.C. 
STREET: 530 Fifth Avenue 
CITY: New York 
STATE: New York 
COUNTRY: U.S. 
ZIP: 10036 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/7 14 , 74 1 
FILING DATE: 16-SEP-1996 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Frommer Esq., William S. 
REGISTRATION NUMBER: 25,506 
REFERENCE/DOCKET NUMBER: 454312-2460 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 840-3333 
TELEFAX: (212) 840-0712 
INFORMATION FOR SEQ ID NO: 32: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 8991 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: amino acid 
US-08-714-741-32 

Query Match 5.3%; Score 316; DB 4; Length 8991; 

Best Local Similarity 21.4%; Pred. No. 9.9e-10; 

Matches 248; Conservative 138; Mismatches 479; Indels 296; Gaps 52; 

Q y 6 QSPLVSSSDSP-PRPQPAFKYQF-VREPEDEEEEEEEEEEDEDEDLEE — LEVLERKPAA 61 

::| ; l 1=1 II I : :h h :::: III || : :::| 

Db 7193 EAPAEQPKPAPAPQPAPAPKPEKPAEQPKPEKTDDQQAEEDYARRSEEEYNRLTQQQPPK 7252 

Qy 62 GLSAAPVP TAPAAG APLMDF 81 

I ' I I I | II 

Db 7253 AEKPAPAPKTGWKQENGMWYFYNTDGSMGEQAGQYRAAAEGDLAAKQAELEKTEADLKKA 7312 



QY 82 GNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPED DE 138 

Is Ml M I II II |:| Ml II :: :| :| 

Db 7313 VNEPEKPAPAPETPA— PEAPAEQPK— PAPAPQPAPAPKPEKPAEQPKAEKTDDQQAEE 7368 



^ 139 PPAR PPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXX 189 

N I I : I I | | | | | I I : | : 

Db 7369 DYARRSEEEYNRLTQQQPPKAEKPAPAPKPEQPAPAPKNSKGEQAEQYRSAAGGDLAAKQ 7428 

®y 190 MDLKEQPGNTISAGQEDFPSVLLET-AASXPSLSPLSAASFKEHEYLGNLSTV 244 

:: Ml: : : : I : | | | I : | | : 
Db 742 9 VELEKTEADLKK AVN E P E K P AP A P E T PAP EAP AEQ P K P AP AP Q 7471 

^ 245 LPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVAN 304 

I I : I I : | | : | Mil:: : MM 

Db 7472 -PAPAPKPEKPAEQPK AEKPADQQAEEDYDRRSEEEYNRL — TQQQPPKAEKPAPAPQ 7526 

®y 305 PREEI 1 VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEVVS SEKAKDS FNE 357 

1 : : II:: III M I : | : | : | 

Db 7527 PEQPAPAPKSLKEIDESDSEDYVKEGFRAPLQSELDAKQAKLSKLEEL SDKIDE 7580 

Qy 358 KRV AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQ 417 

: I : : : I M II M : : I : | | Ml 
Db 7581 LDAEI-AKLEKDVEDFK XSDGEQAGQYLAAAEEDLIAKKA ELEQ 7623 

QY 418 TNHEKDSESSNDDTSFPS— TPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTS 474 

I I I : : :: I: || III II | 
Db 7 62 4 T— EADLKKAVNEPGKPAPAPAPE T PAP EAP AEQ P K PAP 7660 

475 ENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYV — TTDNLTKVTEEWA-- 530 
I I I : I II IIMII : : : : | : : | 

Db 7661 ETPAPAPKPEKPAEQPKPEK PADQQAEEDYARRSEEEYNRLTQQQPAPA 7709 

Qy 531 NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

II : I : I | | | | : : : | 

Db 7710 QKPEQPAKPEKPAEEPTQPEKDAEIAKLE-KNVEYFKKTDAEQTEQYLAAAEKDLADKKA 7768 

2y 585 SFEESEA TPSPV-LPDIVMEAPLNSAVPSAGASVIQPSSSPLEAS 628 

I : : M I I : I I : I I : I I : | : : : 

Db 77 69 ELEKTEADLKKAVNEPEKPAEETPAPAPKPEQPAEQPKPAPAPQP-APAPKPEKTDDQQA 7827 

Qy 629 SVNYESIKHEPENPPPYEEAMSVSLKVSGIKEE IKEPENINAALQETEAPYISIAC 684 

: I III:: || : | | | I I : I 

Db 7828 EEDYARRSEEEYNRLPQQQPPKAEKPAPAPKPEQPVPAEXPEN . PAPAPKPAXAP 78 81 

Qy 685 DLIKETKLSAEPAPDFSDYSEMAKVEQPV PDHSELVEDSS PDS 727 

: I : : : I I : ! : I M : || : I 

Db 7882 QPLKPEEPAEQPKPE KPEEPAGQPEPEKPDDQQAGEDYARRSGGEYNRFPQQ 7933 

Qy 728 EPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGG 781 

; l I I :: | | | | ||: 

Db 7934 QPPKAEKPAPAP KPEQPVPAPKTLL KKAKLAGAKSKAATKKAEL 7977 

Qy 782 KPYLESFKLSLDNTKDTLLP DEV STLSKK-EKI PLQMEELSTAVYSNDD 829 

: > N : 1:1 M I II: : MM I :| |: || : : | 

Db 7978 EPELEKAEAELENLLSTLDPEGKTQDELDKEAAEAELNKKVEALPNQVSELEEELSKLED 8037 

^ 830 LFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLARE YTDLEVSHKSEI 885 

l = l s I : I : MM I:| :| | 

Db 8038 NL— KDAETNNVEDY IKEGLEEAIATKQAELEKTPKELDAALNELGPDGDEEE 8088 



Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKI — S FS DDFS KN GS AT SKVLLLPPDVSAL 943 

I : III: Ml: : | ::: | 
Db 8 08 9 TPPPEAPAEQPKPEKPAE-ETPAPAPKPEKSADQQAEEDYARRSEEEYNRL 813 8 

^ 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRD— I 1000 

II: II Mil: I : : III : : | | 

Db 8139 TQQQ PPK AEKPAPAPAPKPEQPAPAPKSRGLATKKKLNLAEARIELLL 818 6 

Qy 1001 KKTGV VFGASLFLLLS 1016 

Ml: M I Ml 

Db 8187 KKLGLEPGLEKAGAGLGNLLS 8207 



RESULT 11 
US-09-976-594-726 

; Sequence 726, Application US/09976594 

; Patent No. 6673549 

; GENERAL INFORMATION: 

; APPLICANT: Furness, Michael 

; APPLICANT: Buchbinder , Jenny 

; TITLE OF INVENTION: GENES EXPRESSED IN C3A LIVER CELL CULTURES TREATED WITH 
STEROIDS 

; FILE REFERENCE: PA-0041 US 

; CURRENT APPLICATION NUMBER: US/ 09/976, 594 

; CURRENT FILING DATE: 2001-10-12 

; PRIOR APPLICATION NUMBER: 60/240,409 

PRIOR FILING DATE: 2000-10-12 
; NUMBER OF SEQ ID NOS : 1143 
; SOFTWARE: PERL Program 
; SEQ ID NO 72 6 
; LENGTH: 2468 
TYPE: PRT 

ORGANISM: Homo sapiens 
; FEATURE : 

NAME/KEY: mis cofeature 

OTHER INFORMATION: Incyte ID No. 6673549 4295277CD1 
US-09-976-594-726 



Query Match 5.0%; Score 296.5; DB 4; Length 2468; 

Best Local Similarity 21.5%; Pred. No. 2.5e-09; 

Matches 264; Conservative 148; Mismatches 459; Indels 355; Gaps 59; 



Qy 


2 


EDLDQSPLVSSS-DSPPRPQPAFKYQ FVREP EDEE 

II :: IMIII : : 1 : ::| | | | : 
EDGEEHVCVSASKHSPTEDEESAKAEADAYIREKRESVASGDDRAEEDMDEAIEKGEAEQ 


35 


Db 


956 


1015 


Qy 


36 


EEEEEEEEDEDEDLEELEV-LERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPL 
M 1 : 1 1 1 : M 1 1 1 : I 1 1 1 1 1 : | | | 

SEEEADEEDKAEDAREEEYEPEKMEAEDYVMAWDKAAEAGGAEEQYGFLTTPTKQLG — 


94 


Db 


1016 


1073 


Qy 


95 


PAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQ 
: 1 MM 1 Mil || Ml 1 I : 


154 


Db 


1074 


AQSPGREPA— SSIHDETLPGGSESEATA SDEENREDQPEEFTATSGYT 


1120 


Qy 


155 


AEPVWT P PAPAPAAP P ST PAAP KRRGS S GAWXXXXKIMDLKEQPGNT I S AGQE D 

1 1 Ml :| : | | || 
QSTIEI SSEPTPMDEMSTP RDVMSDETNNEETESPSQEFVNITK 


209 


Db 


1121 


1164 



Qy 


210 


FPSVLLETAASXPS-LSPLSAASFKE HEYLGNLSTVLPTEGTLQENVSEAS 

• i i it ii ii i iii 


259 


Db 


1165 


: 1 1 1 1 : : : | | : I I : | : | | : | : : | : : 

YESSLYSQEYSKPADVTPLN — GFSEGSKTDATDGKDYNASASTISPPSSMEEDKFSRSA 


1222 


Qy 


260 


KEVSEKAKTLLIDRDLTEF SELEYSEMGS SFS 

i i i i i i i iii i ii 
IIIMIH 1 1 1 : : 1 : 1 1 

LRDAYCSEV — KASTTLDIKDSI SAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFS 


291 


Db 


1223 


1280 


Qy 


292 


VSP KAESAVTVANPR — EEIIVKN — KDEEEKL VSNNILHX 

= : 1 1 1 : 1 : 1 : 1 : : : : 1 : : I | : : | 

LT PNEI KVSAEAEVAPVS PEVTQEWEEHCAS PEDKTLEWS PSQS VTGSAGHT P YYQS P 


328 


Db 


1281 


1340 


Qy 


329 


QQELPTALTKLVKEDEWS S EKAKDS FNEKRVAVEAPMREEYADFK- P FERVWEVK 

Ml: : 1 1 1 : 1 : : i 1 : 1 1 1 | : | |:| 
TDEKSSHLPTEV — IEKPPAVPVSFEFSDAKDENERASVS PMDEPVPDSESPIEKVLSPL 


383 


Db 


1341 


1398 


Qy 


384 


DSKEDSDMLAAGGKIESNLESPCVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDR 

1 - - ill t i i ii 

1 : = = Ml: II 1 1 1 : : 1 1 : 
RS PPLIGSESAYESFLSA— DDKASGRGAESPFEEKSGKQGSPDQVSPVSE 


443 


Db 


1399 


1447 


Qy 


444 


SGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKK IEEKK A 

: 1 I : | : | : | | I I I : : : : | : | 
MTSTSLYQDKQEGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPT 


488 


Db 


1448 


1503 


Qy 


489 


QI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWA — N 

M : : 1 | | : : | | : | | : : | : | | : 
QIDVSQFGSFKEDTKMSISEGTVSDKSATP-- VDEGVAEDTYSHMEGVASVSTASVATSS 


531 


Db 


1504 


1561 


Qy 


532 


MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAAQLCP 
II Ml : : 1 : 1 : : : MM 1 1 : : 1 : : || 


584 


Db 


1562 


FPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMSIS 


1620 


Qy 


585 


S F E E S EAT P S P VL P - D I VMEAP LN S AVP S AGAS VI 


618 


Db 


1621 


II : M 1 1 : : 1 : MUM 
PPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEQSLAMDFSRQSPDH PTVGAGVL 


1677 


Qy 


619 


Q PSSSPLEASSVNYESIKHE — PENPPPY EEAMSVS-LKVSGIKEEIKE 

M 1 : 1 : 1 : 1 1 1 1 : I I I : : | 
HITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTSSAHT 


664 


Db 


1678 


1737 


Qy 


665 


PENINAALQETE AP YISIACDLIKE TKLSAEP A 

t i . i i i ii i t iii 


697 


Db 


1738 


1 1=111 II ||:::: III: : 
PSQIASPLQEDTLSDVAPPRDMSLYASLTSEKVQSLEGEKLSPKSDISPLTPRESSPLYS 


1797 


Qy 


698 


PDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS — 

MM: | | : || Ml Mil 

PTFSDSTSAVK-EKTATCHS SSSP PIDAASAEPYGFRASVLFDTMQHHLALNR 


734 


Db 


1798 


1849 


Qy 


735 


DDSIPDVP QKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 


779 


Db 


1850 


Ml: 1 1 : II 1 : I : I | : | | : 
DLSTPGLEKDSGGKTPGDFSYAYQKPEET TRSPDEEDYDYES YEKTTRTSDV 


1901 



Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

M I I : : : | : I Ml II I I : | | : 

Db 1902 GGY-YYEKIERTTKSPSDSGYSYE— TIGKTTKTP EDGDYSYE--I IEKTTRTP 1950 



Qy 



840 



ETETFSDSSPIEIIDEFPTLISSKTDS FSKLAREYTDLEVSHKSEIANAPDGAGSLP 8 96 

I : I I I I I I I : I I : : I : I I : 

EEGGYSYD 1 SEKTTS PPEVSGYSYEKTERSRRLLDDI SNGYDDS — — 1994 



Db 



1951 



QY 



897 



CTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE-IESIVK 955 

: I I : : I I I : : II III : I I I 

-EDGGHTLGDPSYSYETTEKITSFPESEGYSYETSTKTTRTPDTSTYCYETAEKITRTPQ 2 053 



Db 



1995 



Qy 



956 



PKVLEKE AEKKLPSDTEKE 974 

I I I I I I I : :: 

ASTYSYETSDLCYTAEKKSPSEARQD 2 07 9 



Db 



2054 



RESULT 12 
US-09-149-476-411 

; Sequence 411, Application US/09149476 

; Patent No. 6420526 

; GENERAL INFORMATION: 

; APPLICANT: Rosen et al . 

; TITLE OF INVENTION: 186 Human Secreted proteins 
; FILE REFERENCE: PZ002P1 

; CURRENT APPLICATION NUMBER: US/09/149, 47 6 

; CURRENT FILING DATE: 1998-09-08 

; EARLIER APPLICATION NUMBER: PCT/US98/ 044 93 

; EARLIER FILING DATE: 1998-03-06 

; EARLIER APPLICATION NUMBER: 60/040,162 

; EARLIER FILING DATE: 1997-03-07 

; EARLIER APPLICATION NUMBER: 60/040,333 

; EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/038,621 
; EARLIER FILING DATE: 1997-03-07 

EARLIER APPLICATION NUMBER: 60/040,626 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,334 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,336 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/040,163 
; EARLIER FILING DATE: 1997-03-07 
; EARLIER APPLICATION NUMBER: 60/047,600 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,615 

EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,597 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,502 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,633 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,583 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,617 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,618 
; EARLIER FILING DATE: 1997-05-23 
; EARLIER APPLICATION NUMBER: 60/047,503 
; EARLIER FILING DATE: 1997-05-23 



EARLIER APPLICATION NUMBER: 60/047,592 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,581 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,584 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,500 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,587 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,492 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,598 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,613 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,582 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,596 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,612 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,632 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,601 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/043,580 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,568 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,314 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,569 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,311 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,671 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,674 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,669 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,312 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,313 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,672 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/043,315 
EARLIER FILING DATE: 1997-04-11 
EARLIER APPLICATION NUMBER: 60/048,974 
EARLIER FILING DATE: 1997-06-06 
EARLIER APPLICATION NUMBER: 60/056,886 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,877 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,889 



EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,893 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,630 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,878 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,662 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,872 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,882 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,637 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,903 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,888 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,879 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,880 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,894 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,911 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,636 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,874 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,910 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,864 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,631 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,845 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/056,892 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/057,761 
EARLIER FILING DATE: 1997-08-22 
EARLIER APPLICATION NUMBER: 60/047,595 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,599 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,588 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,585 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,586 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,590 
EARLIER FILING DATE: 1997-05-23 
EARLIER APPLICATION NUMBER: 60/047,594 
EARLIER FILING DATE: 1997-05-23 



; EARLIER APPLICATION NUMBER: 60/047,589 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,593 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/047,614 

; EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,578 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/043,576 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/047,501 

EARLIER FILING DATE: 1997-05-23 

; EARLIER APPLICATION NUMBER: 60/043,670 

; EARLIER FILING DATE: 1997-04-11 

; EARLIER APPLICATION NUMBER: 60/056,632 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,664 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,876 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,881 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,909 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,875 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,862 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,887 

EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/056,908 

EARLIER FILING DATE: 1997-08-22 

EARLIER APPLICATION NUMBER: 60/048,964 

; EARLIER FILING DATE: 1997-06-06 

; EARLIER APPLICATION NUMBER: 60/057,650 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/056,884 

; EARLIER FILING DATE: 1997-08-22 

; EARLIER APPLICATION NUMBER: 60/057,669 

; EARLIER FILING DATE: 1997-09-05 

; EARLIER APPLICATION NUMBER: 60/049,610 

; EARLIER FILING DATE: 1997-06-13 

; EARLIER APPLICATION NUMBER: 60/061,060 

; EARLIER FILING DATE: 1997-10-02 

Query Match 4.8%; Score 285; DB 4; Length 92; 

Best Local Similarity 57.1%; Pred. No. 1.2e-10; 

Matches 52; Conservative 20; Mismatches 19; Indels 0; Gaps 0; 

Qy 107 9 NSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVP 1138 

I : I : I : I : I : i I I I I : I I I I I I I II I I : I II I I : I I I : I I I I I I : : I I I I 
Db 2 NAAMVIiINRALKLIIRLFLVEDLvT)SLKLAVFMWLMTYVGAVFNGITLLILAELLIFSVP 61 

Qy 1139 VI YERHQAQI DH YLGLANKNVKDAMAKI QAK 1169 

: : I I : : : I I t I I : I : I I : I I : I 

Db 62 IVYEKYKTQIDHYVGIARDQTKSIVEKIPSK 92 



RESULT 13 
US-08-973-462-8 

Sequence 8, Application US/08973462B 
Patent No. 6191270 
GENERAL INFORMATION: 
APPLICANT: DRUILHE, PIERRE 
APPLICANT: DAUBERS I ES, PIERRE 

TITLE OF INVENTION: MALARIAL PRE- ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 
FILE REFERENCE: 0660-0125-0 PCT 
CURRENT APPLICATION NUMBER: US/ 08 / 973 , 4 62B 
CURRENT FILING DATE: 1998-02-06 
EARLIER APPLICATION NUMBER: PCT/FR96/00 8 94 
EARLIER FILING DATE: 1996-06-12 
EARLIER APPLICATION NUMBER: FR 95/07007 
EARLIER FILING DATE: 1995-06-13 
NUMBER OF SEQ ID NOS : 2 9 
SOFTWARE: Patent In Ver. 2.0 
SEQ ID NO 8 
LENGTH: 17 8 6 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : Polypeptide 
US-08-973-462-8 

Query Match 4.7%; Score 276.5; DB 3; Length 1786; 

Best Local Similarity 20.1%; Pred. No. 2.6e-08; 

Matches 232; Conservative 201; Mismatches 471; Indels 249; Gaps 50; 

Qy 30 EPEDEEEEEEEEEEDEDEDLEE L EVLE RK P AAGL S AAP VPT APAAGAP LMD FGN 83 

I I I : I I | | : : I : : i I : I I : : : : I I : : 
Db 22 8 EESVEENDEESVEENVEENVEENDDGSVASSVEES I AS SVDES IDS S I EENVAPTVE 284 

Qy 84 DFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPAR- 142 

: I I : | | | : | : : | III : I I 

Db 2 85 EIVAPS WESVAPSVEESVEENVEESVAENVEESVAENVEESVAENVEESVAEN 33 8 

Qy 143 PPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQ 198 

I | : | | : | II : : II : : : I 

Db 339 VEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESVEENVEESVAENVEESVAENVEES 398 

Qy 199 PGNTISAGQEDFPSVLLETAAS XPSLSPLSAASFKEHEYLGNLSTVLPT-EGTL 251 

: I : I : I | : : : | : : | : : I I : I : : 

Db 399 VAENV EESVAENVEESVAENVEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESV 455 

Qy 252 QENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVI VANP 305 

: I I I I : I I : : : : : | I : I I I : I : I II 

Db 4 56 E ENVE E S VAEN VE E S VAEN VE E S VAENVE E S VAE - NVE E S VAENVEE S VAEN VE E S VAEN 514 

Qy 306 REEIIVKNKDE EEKLVSNNILHX QQELPTALTKLVKED EW 34 6 

Ml: :| |::|: ::: :::::: |:| I 

Db 515 VEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEESVEENVEESVAENVEESVAENVEES 57 4 

Qy 347 S S E KAK D S FN E K RVAVEAPMRE E YA DFKPFERVWEVKDSKEDSDMLAAGGKIESN 4 01 

: I : : I I : I I I I : I I I hi : I I 

Db 575 VAENVEESVAENVEEIVAPTVEEIVAPTVEEIVAPSWESVAPSVEES VEEN 626 



Qy 402 LESKVDKKCFADSLEQTNHE — KDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATE 459 

: I I |:::|:: | ::| : | : III: I |: I 

Db 627 VEESV AENVE E S VAENVE E S VAEN VE E I VAP T VE E I VAPT VE E I VA PSWE 677 

Qy 460 SIATNIFPLLEDPTSENXTDE — KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVT 517 

I : I I : I : I I : : : I I I : I I : II I I 

Db 67 8 SVA PSVEESVEENVEESVAENVEESVAENVEESVAENVEES VAENYE E I VAP T 730 

Qy 518 TDNLTKVT-EEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTS — EVMQ 573 

: : I I I : I I : : I : I I : I : I : I .= I : : I : I I = : 

Db 731 VEEIVAPTVEEIVAPSWESVAPS-VEESVEENVEESV7VENVE-ESVAENVEESVAENVE 788 

Qy 574 ESLYPAAQ — LCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSS PLEAS SVN 631 

I I : I :: I I I I I I I I I I : : : ::: : I : 

Db 789 ESVAPTVEEIVAPSVEESVA-PS VEESVAENVATNLSDNLLSNLLGGIETEEIK 841 

Qy 632 YESIKHEPENPPP YEEAMSVSLKV-SGIKEEIKE PENINAALQETEA 677 

: I I : I I . I I : I : I I I I I : I : I hi 

Db 842 -DSILNEIEEVKENWTTILENVEETTAESVTTFSNILEEIQENTITNDTIEEKLEELHE 900 

Qy 678 PYISIA CDLIKETKLSAEPAPDFSDYSEMA — KVEQPVPDHSELVED 722 

: I I I : I : I I II : I t I : : I : I : 

Db 901 NVLSAALENTQSEEEKKEVI DVIEEVK — EEVATTLIETVEQAEEKSANTITEIFENLEE 958 

Qy 723 SSPDS EPVDLFSDDS I PDVPQKQDETVMLVKES LTET 759 

: : : I I : : : : I I : I i I : III 

Db 959 NAVESNENVAENLEKLNETVFNTVLDKVEETVEISGESLENNEMDKAFFSEIFDNVKGIQ 1018 

Qy 760 SFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK 810 

I I : I : : : I I : : : I : : : | | : | | : : : : I 

Db 1019 ENLLTGMFRS I ETS I VIQS EEKVDL NENWSSILDNIENMKEGLL-NKLENISST 1072 

Qy 811 E KIPLQMEE LSTAVYSNDDLF 831 

I :| :: I :: :|:| 

Db 1073 EGVQETVTEHVEQNVYVDVDVPAMKDQFLGI LNEAGGLKEMFFNLEDVFKS E S DVI TVEE 1132 

Qy 832 ISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIA 886 

: II : :|| : : hi II I :: II 

Db 1133 IKDEPVQKEVE-KETVSIIEEMEENIVDVLEEEKEDLTDKMIDAVEESIEISSDSKEETE 1191 

Qy 887 NAP D GAG — SLPCTELPHD LSLKNIQPK-VEEKISFSDDFSKNGSATSKVL 934 

: I || I : : I I I h ::::::: I III:: 
Db 1192 SIKDKEKDVSLWEEVQDNDMDESVEKVLELKNMEEELMKDAVEIND ITSKLI 1244 

Qy 935 LLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTE KEDRSPSAI FSADLGKT 989 

: I : I : : : I I I I I I I : : hi I I 

Db 1245 EETQELNEVEAD LIKDMEKLKELEKALSEDSKEIIDAKDDTLEKVIEEEHDITT 1298 

Qy 990 SWDLLYWRDIKK 1002 

Db 12 99 TLDEWELKDVEE 1311 



RESULT 14 
US-09-345-473E-40 

; Sequence 40, Application US/09345473E 



Patent No. 6558903 
GENERAL INFORMATION: 
APPLICANT: Hodge, Martin 

TITLE OF INVENTION: No. 6558903el Kinases and Uses Thereof 
FILE REFERENCE: 35800/183781 

CURRENT APPLICATION NUMBER: US/ 09/34 5 , 47 3E 
CURRENT FILING DATE: 1999-06-30 
NUMBER OF SEQ ID NOS : 62 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 40 
LENGTH: 1601 
TYPE: PRT 

ORGANISM: C. elegans 
US-09-345-473E-40 

Query Match 4.5%; Score 267.5; DB 4; Length 1601; 

Best Local Similarity 20.5%; Pred. No. 8e-08; 

Matches 229; Conservative 134; Mismatches 376; Indels 377; Gaps 56 

Qy 29 REPEDEEEEEEEEE EDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGA 76 

I I : I 1 III llllhllhlll Mill 
Db 4 62 RAKEEEERIREEAEIKEELRLRAEAKEKEKERLEK-ERLEKKAAAAAAANPNPTP 515 

Qy 77 PLMDFGNDFVPPAPRGP LPAAPPVAPE RQPSWDPS-PVS — STVPAPSP 122 

: I I I I t II::: : I : I I I I : : : I I I I 

Db 516 IPPTPATPHSSAQQQPIPPPLSTQTSAEIQQSAQQPSVPVTMIANIPAMSP 566 

Qy 123 LSA AAVSPSK LPEDDEPPARPPPPPPASVSPQAEPVW— TPPAPAP — AAPP 170 

II : I I : : I : I I : I : I I I I I : I I I 

Db 567 TSAQPQPVLSPTSAAVPVPTTMIHVPKPSEIPVQNVATTAAPVAANNVPPSPAPFKTEDI 626 

Qy 171 STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAG-QEDFPSVLLETAASXPSLSPLSA 229 

II: II: I : : : I I I I I I : 

Db 627 QTPTLAQ NT VP RT I S T DAS GL VI NT PAS I AS P S PAP S 663 

Qy 230 ASFKEHEYLGNLSTVLP TEG TLQENVSEASKEVSEKAKTLLIDRDLT 276 

I : : : I I hi I I I I I I : : I : 

Db 664 AT DVASTTAPVTPAPTPTTTTDGGAAAASTTTENKEEKRKSNKRKWMEILGCDES 719 

Qy 277 E FSELEYSEMGSSFSVSP KAESAVIVANPREEIIVK 312 

I : I : I : I I : : | | | : 

Db 720 RNFALVSCRLDTSHKSWFQFAPGTDKPCTIATKLLAEDCLLKVHVHIVEAQLGEVIQLI 779 

Qy 313 NKDEEE KLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMRE 368 

It:: ||: : | | : I : : : I I I I : : : : 
Db 780 NSDGKKGVGTKLAT — VLDPNSTEPPTITAVMPKD SSAATASNTKPKIEI 827 

Qy 369 EYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSN 428 

I: :|: :: :::: I I I |: I I 

Db 828 EKTPPTRDASQE PNN VQVTNVRKVS QE S N 856 

Qy 429 DDTSFPSTPEGIKDRSGAYITCAPFN PAAT ES 460 

: I I I I I I : I I I I I I 

Db 857 AE-SVQSIP RPGGIIVMSPTNQTDSAPPPTGAAAKPSRFQVTKSADPIATPISSS 910 

Qy 461 IAT-NIFPLLEDPTSENXTDEKKIEEK-KAQIVTEKNTSTKTSNPFFVAAQDSETDYVTT 518 

I : I : I : : I I I I I : I I : : I I : I : : : I I 



Db 



911 ISTATVIPIVA-ATPTNITSEPVIVQPITAQVITHLATPSPVSHSL SSNSSPSATTH 966 



Qy 519 DNLTKVTEEWANMP-EGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLY 577 

I : : : : : : I I I : I I I : : I I 
Db 967 SNMSSI--QSTTSVPGRRFTVQPVSQA-ESGISSSISTPHPEPT 1007 

Qy 57 8 PAAQLCPSFEESEATPSPVLPDIVMEAPLNSAV PSAGASVIQPSSS PLEAS SVNY 632 

II II I I : I : I II I III : I I I : 

Db 1008 PAITSCP P PVP S VP P WSNGTLNLEVAP KQT P S ATNQNVDTQH S S STASTAT L 1060 

Qy 633 ESIKHEPEN PPPYEEAMSVS LKVSGIKEEI KEPENINAA 671 

: I | | : I : : MM: I : : 

Db 1061 — VSETPATVHVTPISVPAPVQEPLVIDHHSDVLTQLDSELRKVSGVSHS-ASPSTWES 1117 

Qy 672 LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 731 

I | M I : : I I : : : 
Db 1118 LT SMT PQTIP LACQTV- PAS I GQAP AVI AAAHA AS 1151 

Qy 732 LFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLS 791 

I : I M I : I II : Mill I : 

Db 1152 LIPNASVPQSPSRLD AETGLAGL HEKLEAL KME 1184 

Qy 7 92 LDNTKDTLLPDE VSTLSKKEKIPLQ MEELSTAVYSNDDLFI SKEAQIRET — 841 

I : I : I : M I : M I : II : : : : Ml 

Db 1185 QDRRED — MGDDAI GTTTTDGKDEIPIDTLKGLAEALGKVIHADG RETTP 1232 

Qy 842 ETFSDSSPIEII DEFPTLISSKTDSFSKLAREYTDLEVS — HKSEIANAP 889 

M M : M I I : I : II II I I : : I 

Db 1233 MP PDHPDLTDASTQQLI S P SNPDVLTTMS SAVEGSAS STMIEDI DASTSAVDASMMNSMP 1292 

Qy 890 DGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE 949 

II M : : : : I I III: : I I 

Db 1293 PGA QNSTDQIPAAMTLSMDQECAQSMTSSITR NTTGTKLAT 1333 

Qy 950 IESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSAD 985 

I : : III IIMII 

Db 1334 FENL ETALSSTLGTHIRQPNAPSSRD 1359 



RESULT 15 
US-08-978-277A-4 

; Sequence 4, Application US/08978277A 
; Patent No. 6582956 
; GENERAL INFORMATION: 

APPLICANT: Gelman, Irwin H. 

TITLE OF INVENTION: TUMOR SUPPRESSOR GENE 
NUMBER OF SEQUENCES: 20 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Brumbaugh, Graves, Donohue & Raymond 

STREET: 30 Rockefeller Plaza 

CITY: New York 

STATE : NY 

COUNTRY: USA 

ZIP: 10112-0228 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 



; OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ Version 1.5 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/978 , 211 A 
FILING DATE: 
CLASSIFICATION: 514 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/665,401 
FILING DATE: 18-JUN-1996 
ATTORNEY/AGENT INFORMATION: 
; NAME: Clark, Richard S 

; REGISTRATION NUMBER: 26,154 

REFERENCE/DOCKET NUMBER: A30558 - 165/34008 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-408-2558 
TELEFAX: 212-765-2519 
TELEX: 

INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS : 
; LENGTH: 1596 amino acids 

; TYPE: amino acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
HYPOTHETICAL: NO 
ANTI-SENSE: NO 
; FRAGMENT TYPE: internal 
; ORIGINAL SOURCE: 

US-08-978-277A-4 

Query Match 4.5%; Score 265; DB 4; Length 1596; 

Best Local Similarity 21.3%; Pred. No. l.le-07; 

Matches 232; Conservative 145; Mismatches 422; Indels 288; Gaps 47 

Qy 2 EDLD — QSPLVSSSDSPPRPQPAFKYQFVREPEDE EEEEEEEEEDEDEDLEELEV 54 

I : I : : I : I I : I I I I : I I | | : : | | : : | 

Db 620 EELEKVKSATLSSTDST VSEMQDEVKTVGEEQKPEEPKRRVDTSVSWEA 668 

Qy 55 L E RKP AAG L S AAP VP T AP AAGAP L MDFGNDFVPPAPR 91 

I : : I : : I I : I I II : : 

Db 669 LICVGSSKKRARKASSSDDEGGPRTLGGDSHRAEEASKDKEAGTDAVPASTQEQDQAQGS 72 8 

Qy 92 -GPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPAS 150 

I I I 1 I : I : I : I I III I I : 

Db 729 SSPEPAGSPSEGEGVSTWE SFKRLVTPRKK SKSKLEEKAEDSS — VEQLSTE 778 

Qy 151 VSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAG 2 06 

: I I I : I I : I I I : : : I 

Db 779 IEPSREESWV SIKKFIPGRRKKRA DGKQEQATVEDSGPVEI 819 

Qy 207 QEDFPSVLLETAASXPSLSPLSAASFKEHEYL GN LSTVLPTEGTLQENVSE 257 

I I I : I | : : | | | : I I : II I I : I I 
Db 820 NEDDPNV P AWP L S E YN AVE RE KMEAQ GNTELPQLL GAVYVS E E 863 

Qy 258 ASKEVSEKAKTLLID — RDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEI IVKNKD 315 

M : : t I I : I I I : : I : I : I ill: I : 

Db 864 LSKTLVHTVSVAVIDGTRAVTSVEERSPSWISASVT-EPLEHTAGEAMPPVEEVTEKDII 922 



Qy 316 EEEKLVSNNI LHXQQELPTALTKLVKE DEWSSEKAKDSFNEKRVAVEAPMREEY 370 

Hi II: : I |::|:|| I II I 

Db 923 AEETPV LTQTLPEGKDAHDDMVTSEVDFTS EAVTATET 960 

Qy 371 ADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDD 430 

I I I ::| :||::| :: : |: |: :: || | 

Db 961 SEALRTEEVTEASGAEETTDMVSAVSQLTDS PDTTEEATPVQEVESGVLD 1010 

QY 431 TS FPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPTSENXTDEK 481 

I : : : | : | | | : : : | | : 

Db 1011 TEEEERQTQAILQAVADKVKEES QVPATQTVQRTGSKALE KVE 1053 

Qy 482 KIEEKKAQIVTEKNTSTKTSNPFFVA AQDSETDYVTTDNLTKVTEEWANMPEGL 536 

: : I I : : I I | | | | | | | | Ml 
Db 1054 EVEEDSEVLASEKEKDVMPKGPVQEAGAEHLAQGSETGQAT PESL 1098 

Qy 537 TPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPV 596 

I: III : : ::: ::|:::: | : : :|| | 

Db 1099 EVPEVT-ADVDHVATCQVIKLQQLMEQAVAPESS--ETLTDSETNGSTP 1144 

Qy 597 LPD IVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMS 650 

II : ::l I |:| I II:: I I I I I 

Db 1145 LADSDT7VDGTQQDETIDSQDSKATAAVRQSQVTEEEAATAQKEEPSTLPNNVPAQE 1200 

Qy 651 VSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

I : I I I : I : : I :: : I :: I | : | | | | 

Db 1201 EHGEEPG — RDVLEPTQQELTAAAVPVLAKTEVGQEGEVDWLD-GEKVKEE 1248 

Qy 711 QPVPDHSELVEDSSPDSE-PVDLFSDDSIPDVPQKQD-ETVMLVKESLTETSFESMIEYE 768 

I I I I I : I : I : I : I I : I : : III : :| 

Db 1249 QEVFVH SGPNSQKAADVTYDSEVMGVAGCQEKESTEVQSLSLEEGEMETDVEKE 1302 

Qy 769 NKE-KLSALPPEG GKPYLESFKLSLDNTK DTLLPDEVSTL 807 

: I , : I I | | | | |:| | | | : 
Db 1303 KRETKPEQVSEEGEQET7VAPEHEGTYGKPVLTLDMPSSERGKALGSLGGSPSLPDQ 1358 

Qy 808 SKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDS--SPIEIIDE — FPTLISSK 863 

I I : I : : I I I : : : | | | : : | | | : | | | 

Db 1359 D KAGC I EVQ VQ S L DT T VT QT AEAV EKVIETWISETGESP-ECVGAHLLPAEKSSA 1413 

Qy 864 TDSFSKLAREYTDLEVSHKSEI7\NAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFS 920 

I I : : : I : I I : I | | : | : : : ) | | 

Db 1414 TGGHWTLQHAEDTVPLGPESQ AESI PI IVTPAPES — TLHPDLQGEISASQRE 1464 

QY 921 DDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKE7VEKKLP 968 

I I : I I I I I : : I : I | : : : | : 

Db 1465 RSEEEDKPDAGPDADGKESTAIEKVLKAEPEILELESKSNKIVLNVIQTAVDQFARTETA 1524 

Qy 969. SDTEKED 975 

: I I 

Db 1525 PETHAYD 1531 



Search completed: September 29, 2004, 18:21:04 
Job time : 33.2441 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



September 29, 2004, 18:05:43 ; Search time 28.8322 Seconds 

(without alignments) 
3930.111 Million cell updates/sec 

US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 117 8 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database 



PIR_7 8 : * 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


1 


787.5 


13.3 


776 


2 


A46583 


2 


688 


11.6 


208 


2 


160904 


3 


671 


11.3 


267 


2 


A60021 


4 


528 


8.9 


2484 


2 


T26216 


5 


519.5 


8 . 8 


2607 


2 


T26215 


6 


340.5 


5.7 


222 


2 


T26213 


7 


331 


5.6 


7962 


2 


138346 


8 


325.5 


5.5 


5327 


2 


T13564 


9 


324.5 


5.5 


865 


2 


A47282 


10 


320.5 


5.4 


873 


2 


A47283 


11 


299.5 


5.1 


3488 


2 


T34418 


12 


299 


5.0 


3924 


2 


S37431 


13 


290.5 


4.9 


1274 


2 


T16251 



Description 



neuroendocrine- spe 
neuroendocrine- spe 
tropomyosin- relate 
hypothetical prote 
hypothetical prote 
hypothetical prote 
elastic titin - hu 
microtubule-associ 
calcium-binding pr 
calphotin - fruit 
hypothetical prote 
ankyrin 2, neurona 
hypothetical prote 



14 


289 


4. 


9 


2364 


2 


A56577 


microtubule-associ 


15 


289 


4. 


9 


2464 


1 


QRMSPl 


microtubule-associ 


16 


287.5 


4. 


9 


971 


2 


T19431 


hypothetical prote 


17 


285.5 


4. 


8 


1621 


2 


A82255 


hypothetical prote 


18 


278 


4. 


7 


1948 


2 


S00485 


gene 11-1 protein 


19 


274 


4. 


6 


3507 


2 


T34513 


hypothetical prote 


20 


272.5 


4. 


6 


1829 


2 


T24583 


hypothetical prote 


21 


272.5 


4. 


6 


2361 


2 


T25752 


hypothetical prote 


22 


271 


4. 


6 


2187 


2 


T30826 


nascent polypeptid 


23 


270 


4. 


6 


1230 


2 


T22458 


hypothetical prote 


24 


267.5 


4. 


5 


1851 


2 


T19964 


hypothetical prote 


25 


263.5 


4. 


4 


2409 


1 


A60979 


versican precursor 


26 


262.5 


4. 


4 


6642 


2 


T29757 


protein UNC-8 9 - C 


27 


261 


4. 


4 


1558 


2 


B71603 


RESA-H3 antigen PF 


28 


260.5 


4. 


4 


3381 


2 


T42389 


versican precursor 


29 


259.5 


4. 


4 


1828 


2 


A40115 


microtubule-associ 


30 


258 


4. 


4 


1616 


2 


G64242 


cytadherence-acces 


31 


257.5 


4 . 


3 


5170 


2 


T15348 


hypothetical prote 


32 


256 


4. 


3 


990 


2 


151618 


nucleolar phosphop 


33 


256 


4. 


3 


1634 


2 


T26517 


hypothetical prote 


34 


255.5 


4. 


3 


1684 


2 


JW0057 


gravin - human 


35 


253 


4. 


3 


1189 


2 


S56852 


hypothetical prote 


36 


252.5 


4. 


3 


1224 


2 


T14007 


microtubule-associ 


37 


252 


4. 


3 


4377 


2 


A55575 


ankyrin 3, long sp 


38 


250.5 


4. 


2 


1824 


1 


QRHUMT 


microtubule-associ 


39 


250.5 


4. 


2 


1830 


2 


A37981 


microtubule-associ 


40 


248 


4 . 


2 


1825 


2 


S13507 


microtubule-associ 


41 


248 


4 . 


2 


1890 


2 


T04556 


hypothetical prote 


42 


248 


4 . 


2 


3421 


1 


WZBEB6 


367K tegument prot 


43 


246 


4. 


2 


1110 


2 


151116 


NF-180 - sea lampr 


44 


244.5 


4 . 


1 


1029 


2 


T30351 


mucin-like protein 


45 


243 


4. 


1 


1320 


2 


JC5630 


TCOF1 protein - mo 



ALIGNMENTS 



RESULT 1 
A46583 

neuroendocrine-specif ic protein, splice form A - human 

N; Contains: neuroendocrine-specif ic protein, splice form B 

C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: A46583; 160903 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaeker 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A;Reference number: A46583; MUID : 93293865 ; PMID:7685762 
A; Accession: A4658 3 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A;Residues: 1-776 <R0E1> 

A;Cross-references: GB:L10333; NID : g307306 ; PIDN : AAA59950 . 1 ; PID:g307307 
A;Accession: 160903 



A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: mRNA 

A; Residues: 421-776 <R0E2> 

A;Cross-references: GB:L10334; NID:g307308; PIDN :AAA59951 . 1 ; PID:g307309 

C; Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A;Cross-references: GDB:203968; OMIM:600865 
A; Map position: 14q21-14q22 

Query Match 13-3%; Score 787.5; DB 2; Length 776; 

Best Local Similarity 31.9%; Pred. No. 3.1e-25; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25; 

Qy 588 ESEATPSPVLPDI — VMEAPLNSA VPSAGASVIQPSSSPLE ASSVNY 632 

I MINI:: II: I : I : : I I : I : I 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGIEMT PAESTEVNKILADPLDQMKAEAYKYIDIT 200 

Qy 633 — ESIKHEPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

I :| I: :: I I: :|:| |::| I I : : : I 

Db 201 RPEEVKHQEQHHPELEDKDLDFKNKDTDISIKPEGVREPDK-PAPVEGKIIKDHLLEEST 259 

Qy 67 6 EAPYISIACDL IKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVED 722 

I I I I II :l I II: = :: |: :: 

Db 2 60 FAPYID DLSEEQRRAPQITTPVKITLTEIEPSVE TTTQEKTPEKQDICLK 309 

Qy 723 SSPDSEPVDLFS DDS IPDVPQKQ DET VMLVKE S LTET S F 761 

I 1 I : I I III I : I | | : : | | : 
Db 310 PSPDTVPTVTVSEPEDDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEA 363 



Qy 762 ESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELS 821 

: | | | : : I III I I : : : 

Db 364 -KGLSYETAENPRPVGQLADRP EVKARSGPPTIPSPLDHEA 403 



Qy 822 TAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFP TLISSKTDSFS 868 

: : I I I I I : : I I : I II 

Db 404 SSAESGD SEIELVSEDPMAAEDALPSGYVSFGHVGGPPPSP 444 



Qy 869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSD 921 

: I I : I : : I : : I : I I I III 

Db 445 ASPSIQYSILREEREAELDSELIIESCDASSAS EESPKREQDSPPMKPSALD 496 

Qy 922 DF SKNGSATSKVLL LPPDVSALGHTQAEIESIVKP 956 

I : I I I III II I I : : I 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

Qy 957 KVLEKEAEKKLPSDTEKEDRSPSA 1 FSADLGKTS WDLLYWRDIK 1001 

: I : : : I I : I : I II : II I I I I I I I 
Db 552 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAIDLLYWRDIK 599 

Qy 1002 KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1061 

: | | : M | : I II III I I : I I I I I : I I I II I II I I I I I I : I I : I I : I I I I I I : I 
Db 600 QTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKA 659 

Qy 1062 YLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALF 1121 

Ml | : : I : I : I I I : : : II I : I I I I I I I I I I I M I I I I I I I I I : I I I I I I I 

Db 660 YLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGALF 719 



Qy 1122 NGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I II I I : : I : : I : I : : I I : I : I I I I I I I I I I : : : I I I I I I I I I I I I I 
Db 720 N G L T L L LMAWSM FT L P WYVKHQ AQ I DQ YL GLVRT H I NAWAK I Q AK I P GAK RHAE 776 



RESULT 2 
160904 

neuroendocrine-specif ic protein C - human 
C; Species: Homo sapiens (man) 

C;Date: 24-May-1996 #sequence_revision 24-May-1996 #text_change 05-Nov-1999 
C;Accession: 160904 

R;Roebroek, A. J.; van de Velde, H.J.; Van Bokhoven, A.; Broers, J.L.; Ramaekers, 
F.C.; Van de Ven, W.J. 

J. Biol. Chem. 268, 13439-13447, 1993 

A; Title: Cloning and expression of alternative transcripts of a novel 
neuroendocrine-specif ic gene and identification of its 135-kDa translational 
product . 

A;Reference number: A46583; MUID : 932 93865 ; PMID: 7685762 
A; Accession: 160904 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-208 <RES> 

A;Cross-references: GB:L10335; NID:g307310; PIDN: AAA59952 . 1; PID:g307311 

C; Genetics : 

A; Gene: GDB : RTN1 ; NSP 

A;Cross-references : GDB: 203968; OMIM: 600865 
A;Map position: 14q21-14q22 

Query Match 11.6%; Score 68 8; DB 2; Length 2 08; 

Best Local Similarity 67.5%; Pred. No. 6.2e-22; 

Matches 129; Conservative 32; Mismatches 30; Indels 0; Gaps 0; 

Qy 988 KT S WDLLYWRD I KKTGVVFGAS LFLLLS LTVFS I VSVTAYI ALALL SVT I S FRI YKGVI 1047 

I : : I II I I II I I : I I : I II : I II III I I : I I I II : I I I II I I I I I I I I I : 
Db 18 KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 77 

Qy 1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I : I I : I I I I I I : I I I I I : : I : I : I I I : : : M I : I I I I I I I I I 1 I I I I II I 

Db 78 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKF 137 

Qy 1108 AVLMWFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 

Mill: I I I I I I II I I I I I : : I : : I : I : : I I : I : I I I I II I I I I : : : I I I I 
Db 138 AVLMWLLTYVGALFNGLTLLLMAWSMFTLPVVYVKHQAQIDQYLGLVRTHINAWAKIQ 197 



Qy 1168 AKI PGLKRKAE 1178 

I I I I I II II 
Db 198 AKIPGAKRHAE 208 



RESULT 3 
A60021 

tropomyosin-related protein, neuronal - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1993 #sequence_revision 03-Mar-1993 #text_change 05-Nov-1999 

C; Accession: A60021 

R;Wieczorek, D.F.; Hughes, S.R. 

Brain Res. Mol . Brain Res. 10, 33-41, 1991 



A; Title: Development ally regulated cDNA expressed exclusively in neural tissue. 
A; Reference number: A60021; MUID : 91278684 ; PMID: 1647480 
A; Accession: A60021 
A;Molecule type: mRNA 
A; Residues: 1-267 <WIE> 

A;Cross-references: EMBL : X52817 ; NID:g456549; PIDN : CAA37001 . 1 ; PID:g456550 
C; Comment: This neuronal-specif ic mRNA was identified by hybridization to an 
alpha-tropomyosin probe but does not show homology in amino acid sequence. 

Query Match 11.3%; Score 671; DB 2; Length 267; 

Best Local Similarity 66.3%; Pred. No. 4.4e-21; 

Matches 124; Conservative 34; Mismatches 29; Indels 0; Gaps 0; 



Qy 


988 


KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 

I : : 1 1 1 1 1 1 1 1 1 : 1 1 : 1 1 1 : 1 II III 1 1 : 1 1 t 1 1 : 1 1 I II 1 1 1 1 1 1 1 1 1 : 
KSQAIDLLYWRDIKQTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVL 


1047 


Db 


9 


68 


Qy 


1048 


QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 

11:11:111111:1111 1 : : 1 : I : 1 1 1 : : : 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 


1107 


Db 


69 


128 


Qy 


1108 


AVLMWFTWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 

Mill: 1 1 II 1 1 1 1 1 1 1 1 1 : : 1 : : 1 : I : : 1 1 : 1 : 1 1 1 1 : 1 MM : : : 1 1 1 1 
AVLMWLLTWGALFNGLTLLLMAWSMFTLPWYVKHQAQVDQYLGLVRTHINTWAKIQ 


1167 


Db 


129 


188 



Qy 1168 AKIPGLK 1174 

I I I I I : 
Db 189 AKIPGAR 195 



RESULT 4 
T26216 

hypothetical protein W06A7.3c - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Accession: T2 6216 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A;Reference number: Z20173 
A;Accession: T26216 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-2484 <WIL> 

A/Cross-references: EMBL:Z78066; PIDN : CAB514 67 . 1 ; GSPDB: GN00023 ; CESP : W06A7 . 3c 

A; Experimental source: clone W06A7 

C; Genetics: 

A; Gene: CESP : W06A7 . 3c 

A;Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2339/2; 2463/2 

Query Match 8.9%; Score 528; DB 2; Length 2484; 

Best Local Similarity 20.9%; Pred. No. 7.4e-14; 

Matches 277; Conservative 186; Mismatches 461; Indels 402; Gaps 54; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEE EE 38 

: I : : I MM I I : I I : : I 

Db 1397 LEKVEVQP — DLSQNSPAPHKIIDLHF-NI PKDHEDYGNDYVPFGTESSEESQKADGNQE 1453 



Qy 39 EEEEED EDEDLEELEVLERKPA AGLSAAP 67 

:|||| |||: I: |: I : I: 

Db 1454 NQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDVDASDVNEQDEESTLKIL 1513 

Qy 68 — VP T AP AAGAP LMD FGN D FVP PAP RGP L P AAP P VAP — ERQPSW 108 

I I : I : : I I I I I hi I I I 

Db 1514 KWP SEPSLLE--LDFTND — P KVT H VP I P LME P ATM Y L E EMVEW 1 1 ADAVK E VS EME W 1569 

Qy 109 DPSPVSSTVPAPSPLSAAAVSPSKLP-EDDEPPARPPPPPPASVSPQAEPVW 159 

I II I I II : I I I I I I I I I I I : I : 

Db 1570 TESEISEMAPQVSESTCPIPEPL- ADLKLPVEDDEKTPEPEPWPGQVQERI IPIE 1624 

Qy 160 TPPAPA- PAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVL — LE 216 

I I I I I I : : I I I : 

Db 1625 VEQAPTIPQRPPRAP KSELPKVAKPLD 1651 

Qy 217 TAASXPSLSPLSA ASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLL 27 0 

: | :| |: : :: I : :| Mil: : :::: I : I 

Db 1652 DSKSRVRFAPLNIKLGRTYSEEQQKELVESLERPL-TIITQQKPPEKPTEDIG — ALSPL 17 08 

Qy 271 I DRDLTEFS ELEYSEMGS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQ 330 

| | : I : : I I : : I : I I I I : : I I 
Db 17 09 SPNTLAEYEEVPMMDMQS VPHSPQE KQEEIEALSEII 1745 

Qy 331 ELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFK PFERVWEVKDSKE 387 

| I I : : : I I : I I : I : : I : I : I | : : I I : 

Db 1746 EEPQAMKEVEKPVE- SAPEKDNESLEAPEI - INEP I RRVLVETKIMGPGKSLNEDNDDDD 1803 

Qy 388 D-SDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESS NDDTSFPSTPEGIKD 442 

I I : I : I : I : : I I : : : : I I I I I I : I I : 

Db 1804 DGSECLDSIGDLS^ ERTIQR — FNT S I DDPS I RRDS FS S I S S FGDRQKFRTAI ENI RQ 1858 

Qy 443 RSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSN 502 

II :::::: I : :|| 

Db 1859 DLLPFQSSVSQYLRSSPNP SQQLLVTN 1885 

Qy 503 PFFVAAQDSETDYWTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETK 562 

: I I : I : I III | | : : : I : : I : : I 

Db 1886 LSMDSPSD — LSPNAPPVGFENTAQFLEKLQQE-DRPSAEGSIDSSGFEKVDHE — 1936 

Qy 563 MDLVQT S EVMQE S L YP AAQ LC P S FEE S EAT P S P VL P D I VMEAP LN S AVP SAGAS VI Q P S S 622 

: I I I I : I I I : : : I I 

Db 1937 GLDEFAAPP VHDPMQKSVFGSLGSDDMKPGS 1967 

Qy 623 SP LEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEIKEPENIN AAL 672 

: I : I | : : | : | : : | I III 

Db 1968 QDDGFVFIERNEAN EATLKKNQKMS SHHNDVI EKNYFNDNAPTAAL 2 013 

Qy 673 QETEAPYISIACDLIKETKLSAEPAPDFSDYSEMA KVEQPVPDHSEL 719 

I : I II::: II 1 = 1 = I Ml: I 

Db 2014 — LESPIAEEARKLVQDAVESA S EYKKQAVDS GDEI GRELLDNVEQKIEQVKEP 2065 

Qy 720 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

: I I : I I II Ml :: I I : I I M 
Db 2066 I VDS L- -HKAYDGVGDFVHETVPNAVDDFVREAEKQLPES PVPEKI E TPE 2113 



Qy 


/ o U 


rrmvT T-i c rpT/T ct pixtt vr>T t — T PTiTta/'C.TT QK'PfFK'T PT.nMFFT.^TAVYSMnDT.FT ^KF AO 

1 : II: : III :: = 1 = 

DT \mTunT\mTr\7"WriT?A/TiMFT RRFPTP- PFF.TDDVAPLSDDKPOFGNOTP 


837 


Db 


O 1 1 /I 

Z1X4 


2161 


Qy 




TTDTrrnTrrpTcric CDTI7TT n^TTDTT T c vrp r\ a -pc V'T. AR F YTnT.F.VSHK^ FT AN A PDG AGS T, PC 


897 


DD 


z X oZ 


: :l : :: 1 I: III: 


2215 


Qy 


0 y 0 


TELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPK 
: : I 1 : : | | : | : : : | 


957 


Db 


2216 


TTDPT rpPPTTM FPW PiP«rFFT\/"PT?PP 


99^7 


Qy 


958 


VLEKEAEKKLPSDTEKEDRSPSAIFSADLG KT — 

II II II : HI II 

X rirKDXoUrji^Vi\LTO 1 V I\ILbronnnunrij o rnnOl 1j rvriri ^ Ur\Yi x uc rvi v r 


QQQ 

J o z> 


Db 


22 38 


2286 


Qy 




-SWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQ 
1 : 1 : : 1 1 1 1 1 1 : : 1 : 1 : 1 1 : : : : 1 1 1 1 1 1 : : 1 1 

PCVLD V I YWRDAKKSAX VLoXiAXiXiVLr VXlAA i FXjXiI V V 1 i oijijlj/\ij^/\/\/\^r i\v r j\j\V£i/i 


1048 


Db 


22 87 


£. O *± \J 


Qy 


1049 


AIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFA 
| : | : I Ml | : : : : | | : : : 1 1 : 1 : : 1 1 : : : 1 : M 

rs-r wTnQFMDFQFTT AnnT TT PDFK\/H An ADVFVFHATCT ANKTiKKLVFVF.S PLES IKFG 


110ft 

X J. \J O 


Db 


Z j4 / 


2406 


Qy 


lino 

xxuy 


\n TvfTAA/'Tr'T'VA/'P zxt FMPT TT T TT AT T*=1T F^VPVT YFRHOAOT DHYI.GLANKNVKDAMAKI OA 

: : : | I I : : 1 : 1 i 1 1 1 1 : : 1 1 1 1 : 1 1 : I 1 1 : 1 : : : 1 : 1 
LVLWSLTYI ASWFSGFTLAI LGLLGVFS VPKVYESNQEAI DPHLAT I S GHLKNVQNI I DE 


1168 


Db 


2407 


2466 


Qy 


1169 


KIPGLK 1174 




Db 


2467 


1 : 1 1 : 

KLPFLR 2472 





RESULT 5 
T26215 

hypothetical protein W06A7.3a - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26215 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A;Accession: T26215 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-2607 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB01522 . 2 ; GSPDB : GN00023 ; CESP : W06A7 . 3a 

A; Experimental source: clone W06A7 

C; Genetics : 

A; Gene: CESP : W06A7 . 3a 

A; Map position: 5 

A;Introns: 1827/1; 1866/3; 1963/3; 1990/3; 2262/1; 2289/1; 2412/1; 2462/2; 
2586/2 

Query Match 8.8%; Score 519.5; DB 2; Length 2607; 

Best Local Similarity 20.4%; Pred. No. 1.8e-13; 

Matches 292; Conservative 175; Mismatches 474; Indels 491; Gaps 55; 



Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEE EE 38 

: | : : I I : I I I I : I I : : I 

Db 1397 LEKVEVQP--DLSQNSPAPHKIIDLHF-NIPKDHEDYGNDYVPFGTESSEESQKADGNQE 1453 

Qy 39 EEEEED EDEDLEELEVLERKPA AGLSAAP ~ 67 

: IN |||:|:|: I : |: 

Db 1454 NQEEEDWAELNFHPIRQWRDEDVISLQSLKSLVAEVGCITDVDASDVNEQDEESTLKIL 1513 

Qy 68 — VP TAPAAGAP LMDFGNDFVP PAP RG P L PAAP P VAP — ERQPSW 108 

I I : I : : I I 1 I I hi I I I 

Db 1514 KWPSEPSLLE — LDFTND — P KVI H VP I P LME P ATMYLE EMVEWI I ADAVKE VS EME W 1569 

Qy 109 DPSPVSSTVPAPSPLSAAAVSPSKLP-EDDEPPARPPPPPPASVSPQAEPVW 159 

I 11 I I I I : I I I I I I I I I I I : I : 

Db 1570 TESEISEMAPQVSESTCPIPEPL ADLKLPVEDDEKTPEPEPWPGQVQERIIPIE 1624 

Qy 160 TPPAPA-PAAPPSTP AAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQE 208 

I I I I I I M I : I : I I I I : 

Db 1625 VEQAPTIPQRPPRAPKSELPKVAKPLDDSKS RVRFAPLNIKLGRTYSEEQQKEL 1678 

Qy 209 DFPSVLLETAASXP SLSPLSAASFKEHEY — LGNLSTVLPTEGTLQEN 254 

: I :: I I : I I I 1 I : I : I ::::(: II 

Db 1679 VESLERPLTII-TQQKPPEKPTEDIGALSPLSPNTLAEYEEVPMMDMQSVPHSPQEKQEE 1737 

Qy 255 VSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK — 312 

: | | - | : | : : : I : I : : I : I : I : : I : 

Db 1738 I-EALSEIIEEPQAM KEVEKPVESAPEKDNESLEAPEIINEPIRRVLVETK 1787 

Qy 313 NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEA 364 

I : I : : : I : I I I : II : : : 

Db 1788 IMGPGKSLNEDNDDDDDGSECLDSIGDL SERTIQRFN TSIDD 1829 

Qy 365 P — MREEYADFKPFERVWEVKDSKED — SDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

I | : : : I : : : I : I : I : I I : : I M 

Db 1830 PSIRRDSFSSISSFGDRQKFRTAI ENIRQDLLPFQSSVSQYLRSSPNP SQQLLVTNL 1886 

Qy 421 EKDSESSNDDTSFPSTPEGI KDRSGA — YITCAPFNPAATESIATNIF 466 

III 11:11 : I I I 1:1 I : 

Db 1887 SMDSPS DLSPNAPPVGFENTAQFLEKLQQEDRPSAEGSIDSSGFEKVDHEGLDEFAA 1943 

Qy 467 PLLEDPTSEN XTDEKK IEEKKAQIVTEKNTSTKTSNPFFVAAQDS 511 

1 : I I : : : I : I I I : I II : 1 : I = : 

Db 1944 PPVHDPMQKSVFGSLGSDDMKPGSQDDGFVFIERNEANEATLKKNQKMSSHHNDVIEKNY 2003 

Qy 512 ETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV-TGTKIAYE TKMDLV 566 

I I : I : I I I I : I I : I : I : I I : : 

Db 2004 FNDNAPT AALLES PI AEEARKLVQDAVESASEYKKQAVDS GDEI GRELLDNVEQKI E 2060 

Qy 567 QTSEVMQESLYPAAQLCPSFEESEAT PS PVLPDI VMEAPLNSAVP SAGAS VIQPSSSPLE 626 

I I : : I I : I I I I : I I I I I 
Db 2061 QVKEPIVDSLHKAYDGVGDFVH-ETVPNAV-DDFVREAE 2097 

Qy 627 ASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDL 686 

I I I : I I I : I : I I I : I : 

Db 2098 KQLPESPVP EKIETPE P LVD I HDTV 2122 



Qy 687 IK ETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDV 741 

I : I II I I :| :| I: I : : 

Db 2123 DKVHDEVDNFLRREPTPPFE TDDVAPLSDDKPQFGNQT — 2160 

Qy 742 PQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLP 801 

:: I I I I I I : I 

Db 2161 -PEEDETTFDRKGPLT IP 2177 

Qy 802 DEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLIS 861 

: I I I : I : I I : I : I I : : 

Db 2178 EEV EKAAAAQNND LDDFDPLVT 2199 

Qy 862 SKT DSFSKLAR EYTDLEVSHKS EI ANAPD 890 

|| I : I : : I : I I : I 

Db 2200 SNTGAAFGAAVG7\AAAVESLTEEEMFGHQKFETVPRPPTPPKDISDEDVK-PSTVNLGPS 2258 

Qy 891 GAGSLPCTELPHD LSLKNIQPKVEEKIS FSDDFS 924 

||:ll : | : | : I I : M 

Db 2259 HHHSHPSS— PHHSILKHHGDAWIDFKTVPPCAQNAFSPGEIMFLLAFFVYLSCFASFFS 2316 

Qy 925 KNGSATSKVLLL PPDVSALG 94 4 

I: :i I :l I I 

Db 2317 KSLPLLDNLLSLWYLSISLIIHVKHHRKFRWNEEQATTMSKLGAVGRGLYALIAFIVNI 2376 

Qy 945 — HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLYWRDIKK 1002 

: : I | Ml I I I I I : I : : I I M I I 

Db 2377 VL RVGLN VALVVGVAVS AH EAYKLT K S S GVLRKKEVLDVI YWRDAKK 2423 

Qy 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

: : | : I : I I : : : : I I I I I I : : I I I : I : I I I I 

Db 2424 SAIVLSIJ^LVLFVI^YPLLTVVTYSLLLALGAAAGFRVFKKVEAQIKKTDSEHPFSEI 2483 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

| : : : : | | : : : I I : I : : I I : : : I : I I : : : I I I : : I : 
Db 2484 LAQDLTLPQEKVHAQADVFVEHATCIANKLKKLVFVESPLESIKFGLVLWSLTYIASWFS 2543 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 1174 

I I I I I I :: I I I I : I I : I I I : I : : : I : I I : I I : 

Db 2544 GFTLAILGLLGVFSVPKVYESNQEAIDPHLATISGHLKNVQNIIDEKLPFLR 2595 



RESULT 6 
T26213 

hypothetical protein W06A7.3b - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C;Accession: T26213 
R;Ainscough, R. 

submitted to the EMBL Data Library, August 1996 
A; Reference number: Z20173 
A;Accession: T26213 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-222 <WIL> 

A;Cross-references: EMBL:Z78066; PIDN : CAB01523 . 1 ; GSPDB: GN00023; CESP : W06A7 . 3b 
A; Experimental source: clone W06A7 
C; Genetics : 



A; Gene : CESP : WO 6A7 . 3b 
A; Map position: 5 
A;Introns: 27/1; 77/2; 201/2 



Query Match 5.7%; Score 340.5; DB 2; Length 222; 

Best Local Similarity 32.4%; Pred. No. 1.4e-07; 

Matches 66; Conservative 52; Mismatches 83; Indels 3; Gaps 2; 

Qy 972 EKEDRSPSAIFSADL-GKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIA 1030 

: I : | : : : | | | : : I : : I I I I I I : : I : I : I I : : : : I I 
Db 9 KKYSKQPTWPATDFPGK--ILDVIYWRDAKKSAIVLSLALLVLFVIAKYPLLTvArrYSL 66 

Qy 1031 LALLSVTISFRIYKGVT QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIK 1090 

I I I t : : I I I : I : I I II I : : : : I I : : : I i 

Db 67 LLALGAAAGFRVFKKV^QIKKTDSEHPFSEIIAQDLTLPQEKVmQADVFVEHATCIAN 12 6 

Qy 1091 ELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDH 1150 

: | : : | | : : : I : I I : : : I I I : : I : I I I M I : : I I I I : I I : I II 
Db 127 KLKKLVFVESPLESIKFGLVLWSLTYIASWFSGFTLAILGLLGVFSVPKVYESNQEAIDP 186 

Qy 1151 YLGLANKNVKDAMAKIQAKIPGLK 1174 

: I : : : i : I hi h 
Db 187 HLATISGHLKNVQNIIDEKLPFLR 210 



RESULT 7 
138346 

elastic titin - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text_change 21-Jul-2000 
C;Accession: 138346 
R;Labeit, S.; Kolmerer, B. 
Science 270, 293-296, 1995 

A; Title: Titins : giant proteins in charge of muscle ultrastructure and 
elasticity. 

A;Reference number: A57430; MUID : 96026330 ; PMID: 7569978 
A;Accession: 138346 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-7962 <RES> 

A;Cross-references: EMBL:X90569; NID : gl017426 ; PIDN : CAA62189 . 1; PID:gl017427 

C; Genetics : 

A; Gene: GDB : TTN 

A;Cross-references : GDB: 127867; OMIM: 188840 
A;Map position: 2q31-2q31 

Query Match 5.6%; Score 331; DB 2; Length 7962; 

Best Local Similarity 20.0%; Pred. No. 4.7e-05; 

Matches 235; Conservative 144; Mismatches 404; Indels 392; Gaps 48; 

Qy 32 EDEEEEEEEEEEDEDEDLEELE — VLERK PAAGLSAAPV PTAPA 7 3 

I : I I I I I I I 11:111 I : I I : = I I I I I 

Db 6176 EEEREEEEEAEVTEYEVMEEPEEYWEEKLHIISKRVEAEPAEVTERQEKKIVLKPKIPA 6235 

Qy 74 AGAPLMDFGNDFVPPAPRGPLPAAP-PVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSK 132 

: I : I II : I I :: I I I I I I I I 

Db 6236 K IEEPPPAKVPEAPKKIVPEKK VPAPVP-KKEKVPPPK 6272 



133 LPEDDEPPA RPPP PPPASVS P QAE P VWT P PAPAPAAP P S - T PAA 175 

: M : : I : I I Mil: III I I = I 

6273 VPEEPKKPVPEKKVPPKVI KMEEPLPAKVTEKHMQITQEEPCVLVAVTKKEAPPKARVPEE 6332 

176 PKRRGSSGAWXXXXKIMDLK EQPGNTISAGQEDF PSVLLETAASXPS-LSPL 227 

Ml I : : M I : I : : : : I : I I : : 

6333 PKR AVPEEKVLKLKPKREEEPPAKVTEFRKRWKEEKVSIEAPKREPQPIKEV 6385 

228 SAASFKEHEYLGNLSTVLPTEGTLQEN-VSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 28 6 

: Ml Ihlll : I I : : : I I I I I : I 

6386 TIMEEKERAY TLEEEAVSVQREEEYEEYE EYDYKEFEEYEPTEE 6429 

287 GSSFS VSPKAESAVIV — ANPREEI 309 

: I I I I : : I I I : : 

643 0 YDQYEEYEEREYERYEEHEEYITEPEKPI PVKPVPEEPVPTKPKAPPAKVLKKAVPEEKV 64 8 9 

310 IV KNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAP 365 

I || I : :::::: | | : | | | : I I I I I I 

64 90 PVPIPKKLKPPPPKVPEEPKKVFEEKIHISITK — REKEQVTEPAAKVPMKPKRWAEEK 6547 

366 MREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSE 425 
: : I I I I I I : : | : : | : : : I : I I : : i 

6548 VPVPRKEVAPPVRVPEVPKELEPEEV AFEEEWTHVE-EYLVEEEEEYIHEEE-E 6600 

42 6 SSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPTSENXTDEKKIEE 485 
:: ||:: II I : I I :: I I I I 

6601 FITEEEWPVIPVKVPE VPRKPVPEEKKPVPVPKKKEAPPAKVPEVPKKPEE 6652 

4 86 KKAQIVTEKNTSTKTSNPFF VAAQDSETDYVTTDNL 521 

I :: : I I | | :: | : 

6653 KVPVLIPKKEKPPPAKVPEVPKKPVPEEKVPVPVPKKVEAPPAKVPEVPKKPVPEKKVPV 6712 

522 TKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEV 571 

III : I I I | : I : : I : =11 M 

6713 PAPKKVEAPPAKVPEVPKKLIPEEKKPTPVPKKVEAPPPKVPKKREPVPVPVALPQEEEV 6772 

572 M-QESLYPAAQLCPS FEES EAT P 593 

: : I : I : : I III I 
6773 LFEEEIVPEEEVLP — EEEEVLPEEEEVLPEEEEVLPEEEEIPPEEEEVPPEEEYVPEEE 683 0 

594 S P VL P D I VMEAP LN S AVP SAGAS VI Q PSSSPLEASSVNYESI- 635 

I I I : : : I : : I I I : hi i : I 

6831 EFVPEEEVLPEVKPKVPVPAPVPEI KKKVTEKKVVI PKKEEAP PAKVPEVPKKVEEKRI I 6890 

535 — KHE PENPPPYEEAMSVSLKVSGIKEEI KEPENIN 669 

II I I I I II I I : : I I I 

6891 LPKEEEVLPVEVTEEPEEEPISEEEIPEEPPSIEEV EEVAPPRVPEVIK 6939 

670 AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 729 
| : | I I : I I I II : : I : I :: I 

6940 KAVPEAPTPV PKKVEAPP AKVS KK I P E E KVPVPVQ KKEAP P 6980 

730 VDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFK 7 89 
: I : I I : I I : M | | : | | | : 
6981 A KVPEVPKKVPEKKVLV PKKEAVPPAKGR 7009 



Qy 790 LSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSP 849 

|:| ::|| ::| : : II :: I M : I I : : 

Db 7010 TVLEEKVSVAFRQEVWKERLELEWEAEVEE— IPEEEEFHEVEEYFEEGE 7 059 

Qy 850 IEIIDEFPTLISSKTDSFSKLAREYTDLEV--SHKSEIANAPDGAGSLPCTELPHDLSLK 907 

: : I I | : : :: : : :|| : : |: I I : : I I 

Db 7060 FHEVEEFIKLEQHRVEEEHRVEKVHRVIEVFEAEEVEVFEKPKAPPKGP EISEK 7113 

Q y 908 NIQPK — VEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

IN MM::: Ml 
Db 7114 IIPPKKPPTKWPRKEPPAKVPEVPKKIWEEKVRVPEE PRVPP 7157 

Qy 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSP 978 

: : : : II I I I I M I : M 

Db 7158 TKVPEVLPPK — EWPEKKVPVPPAKKPEAP 7186 



RESULT 8 
T13564 

microtubule-associated protein homolog - fruit fly (Drosophila melanogaster ) 
N;Alternate names: hypothetical protein EG:49E4.1 
C; Species: Drosophila melanogaster 

C;Date: 13-Aug-1999 #sequence_revision 13-Aug-1999 #text_change 17-Nov-2000 
C; Accession: T13564 

R;Spanos, L. ; Papagiannakis , G. ; Siden-Kiamos , I.; Louis, C. 
submitted to the EMBL Data Library, April 1999 

A; Description: Sequencing the distal X chromosome of Drosophila melanogaster. 
A; Reference number: Z17689 
A; Accession: T13564 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-5327 <SPA> 

A;Cross-references: EMBL : AL031128 ; PIDN : CAA20006 . 1 
C; Genetics : 

A;Cross-references : FlyBase: FBgn0025392 

A;Introns: 24/2; 52/3; 104/3; 179/1; 232/1; 1669/3; 2566/1; 4798/3; 5272/1 
A;Note: EG:49E4.1 

C; Super family: Drosophila 576K microtubule-associated protein homolog 

Query Match 5.5%; Score 325.5; DB 2; Length 5327; 

Best Local Similarity 23.6%; Pred. No. 4.6e-05; 

Matches 275; Conservative 151; Mismatches 520; Indels 219; Gaps 53; 

EPEDEEEEEEEEEE DEDEDLEELEVLERKPAAGLS AAPVPTAPAAG 7 5 

: : I : I I I : : I I I : | | : : : II I : 



Qy 


28 


Db 


1277 


Qy 


76 


Db 


1337 


Qy 


127 


Db 


1396 


Qy 


178 



| I : M : I I I II I : I I : M I I I 

KDREDTGSIESPPTIEEAI EVEVQAKQEAQKPVPAPEEAIKTEKSPLASK-ETSRPESAT 1395 

AVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPA APK 177 

: II I ||: : I : Ml I : I : I I 
GSVKEDTEQTKSKKSPVPSRPESEAKDKKSPFASGEASRPESVAESVKDEAGKAES 14 51 

RRGSSGAWXXXXKIMDLKEQPGNTISAGQE DFPSVLLETAASXP-SLSPLSA 22 9 

I I I : I II I : III M I I : : I 



Db 



1452 RRESIAKTHKDESSLDKAKEQESRRESLAESIKPESGIDEKSALASKEASRPESVTDKS- 1510 



Qy 230 ASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSS 289 

|| ::: | | | | : II I I : : I I I I : 

Db 1511 KEPSRRESIAESLKAESTKDEKSAPPSKEASRPGSWESVKDETEKSKEPSRRESIA 1567 

Qy 290 FSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNN 1 LHXQQELPTALTKLVKEDEW 346 

III | : | | : I III I I : : I :: : I I :: : 

Db 1568 ESAKPPIEFRE-VSRP-ESVIDGIKDESAKPESRRDSPLASKEASRPESVLESVKDEPIK 1625 

Qy 347 SSEKAK DSFNEKRVAVE-APMREEYADFKPFERVWEVKDS — KEDSDMLAAGGKI 398 

| : | | : : : I I I : I : : : I I I h III : I I : 

Db 1626 STEKSRRESVAESFKADSTKDEKSPLTSKDIS-RPESAVENVMDAPFKETSRPESAVGSM 1684 



Qy 399 ESNLESKVDKKCFADSLEQTNHE — KDSESSNDDTSFP-STPEGIKDRSGAYITCAPFNP 455 

: M I : I I I ::: I I I I I I I I I I 

Db 1685 KDESMSK EPS RRES VKDGAAQ S RET S RPAS VAESAKD — GADDLKELSRP 1732 



Qy 456 AATE SIATNIFPLLEDPTS ENXTD — EKKIEEKKAQIVTEKN--TSTK 499 

: | || I I : I I : I I I I I : : I I I : I : 

Db 1733 ESTTQSKEAGSIKDEKSPLASEEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKE 1792 

Qy 500 TSNPFFVAAQDSETDYVTTDNLTKVTEEWANMP EGLTPDLVQEACESELNEVTGT 555 

III:!' II I I I I I: : I 
Db 1793 ASRPASVA--ESIKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDE AE 1845 



Qy 556 KIAYETKMDLVQTSEVM — QESLYPAAQLCPS FEES EAT PSPVLPDIVME-APLNSAVPS 612 

I | : : : I : : I : I I : : I : I : : I I : I I I I 

Db 1846 KSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEESRRESVAEKSPLPSKEAS 1905 



Qy 613 AGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEI KEPENIN 669 

III: || : : | | : M : I I : I II 

Db 1906 RPASVAES I KDEAEKS KEES RRE S VAEKS PLP S KEAS RPAS VAES I KDEAEKS KEE S RRE 1965 



Qy 670 AALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEP 72 9 

: ::: I II I I I :| :| I I I : I I 
Db 1966 SVAEKSPLP S KEAS RPASVAES I KDEAEKS KEE S RRES VAEKS P — 2009 

Qy 730 VDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIE — YENKEKLSALP-PEGGKP — Y 784 

: I I : | | | : : : : | | | : : I I I I : I 

Db 2010 LPS KEAS RPASVAES I KDEAEKSKEESRRESVAEKS PLP SKEAS RPASV 2058 

Qy 785 LESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEE LSTAVYSNDDLFI SKEAQIRE 840 

Ml : : I : I I II I I : I II I : I I I I 

Db 2059 AESIKDEAEKSK EES RRE S AAEKS PLP S KEAS RPAS VAES VKDEADKS KEE S RR- 2112 

Qy 841 TETFSDSSPIEII — DEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCT 898 

| : : : I : I I : I I : : I : I I I I I : III: 

Db 2113 -ESMAESGKAQSIKGDQSPLKEVSRPES VAES VKDDPVKSK- EPS RRES VAGSVTAD 2167 

Qy 899 ELPHDLSLKNIQPKVEEK ISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE 949 

I :: I :| I I I: I I = " 

Db 2168 SARDDQSPLESKGASRPESWDSVKDEAEKQES RRESK 2205 

Qy 950 IESIVKPKVLEKEAEKKL PSDTEKEDR-S PSAI FSADLGKTSWDLLYW RDI 1000 

M : : I I : : : | : : : : I : I I I I : : I : : : II 

Db 22 06 TESVIPPKAKDDKSPKEVLQPVSMTETIREDADQPMKPSQAESRRESIAESIKASSPRDE 22 65 



Qy 1001 KKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFR 1060 

| M M : : | : : 1 I I I I I I 

Db 2266 KSP LASKEASRPGSV AESIKYDLDKPQIIKDDKSTE-HSRR 2305 

Qy 1061 AYLESEVAI-SEELVQKYSNSALGH 1084 

I I : I : I I : I : : I I 
Db 2306 ESLEDKSAVTSEKSVSRPLSVASDH 2330 



RESULT 9 
A47282 

calcium-binding protein calphotin - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 18-Nov-1994 #text_change 21-Jul-2000 
C; Accession: A4 7282 

R;Martin, J.H.; Benzer, S.; Rudnicka, M. ; Miller, C.A. 
Proc. Natl. Acad. Sci. U.S.A. 90, 1531-1535, 1993 

A;Title: Calphotin: a Drosophila photoreceptor cell calcium-binding protein. 

A; Reference number: A47282; MUID: 93165729; PMID: 8094559 

A/Accession: A47282 

A; Status : preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-865 <MAR> 

A;Cross-references: GB:L02111; NID:gl57031; PIDN :AAA28405 . 1 ; PID:gl57032 
A; Experimental source: photoreceptor cells 

A;Note: sequence extracted f rom NCBI backbone (NCBIN : 124955 , NCBIP : 124956) 

C; Genetics : 

A; Gene: FlyBase:Cpn 

A;Cross-references : FlyBase : FBgn0010218 

C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 
C;Keywords: calcium binding 

Query Match 5.5%; Score 324.5; DB 2; Length 865; 

Best Local Similarity 22.3%; Pred. No. 4.1e-06; 

Matches 229; Conservative 128; Mismatches 375; Indels 297; Gaps 49; 

AGL S AAP VP TAPAAGAP LMDFGNDFVP PAP RG P L PAAP P VAP ERQPSWDPSPVSS T VP 118 
: MM I I II: I III I: I I I |: I |:| 



I I : I : : M : I : I II 1 I I I : I I : I I I I I I I I : 

AP AP I AAAS VT P VAS VAP P WAAPT P P AA- S P VS T PVAVAQ I P VAVS AP VAP PVAAT 121 



Qy 


59 


Db 


9 


Qy 


119 


Db 


66 


Qy 


173 


Db 


122 


Qy 


233 


Db 


161 


Qy 


293 


Db 


195 



I : I II I : I I I : I I : I : 

>VIAT PPVAASA PT PAAVTPVISPVIAS — 160 



1:1 I | |: : | :: I :| 
- P PWPANTT VP VAAP VAAVP AAVP WAP VLAP AV 194 



1:11 I I I I : | : | : I I I : 

•APWAETPAPPPVAEI PVAT 1 PECVAPLI PEVS WA 234 



Qy 34 8 SEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVD 407 

: I : I I I I I | :: :|: : : I 

Db 2 35 T K P LAAAE PVWAP PAT ET PWAPAAASPHVSVAPAVETAWAPVS 28 0 

Qy 408 KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFP 467 

II : : I : I I I II 
Db 2 81 ASTEPPV AAATLTTAPETPAL — 301 

Qy 468 LLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

I : I I II I II 
D b 302 APWAESQ VAA NTWATPP 32 0 

Qy 52 8 WANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA 579 

I I I : I : I : I II : I I I : I I I i : I I 

Db 321 TPAPEPETIAPPW AETPEVASVAVA-ETTPPW — PPVAAESI-PAPWATTPV 371 

Qy 580 -AQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHE 638 

I I : : I : I I I :: : I : I I I : I I : I I : 

Db 372 PATLAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVLPPVAAEPVPAWAEET 431 

Qy 639 PENPPPYEEAMSV-SLKVSGIKEEIKEPENIN7\ALQETEAPYISIACDLIKETKLSAEPA 697 

| I I I : :: : | : : I I : I I I : I : : I I I 

Db 432 P ET P AP AS AP VT I AALD I P EVAP VI AAP S DAP AEAP S AAAP I VS T P PT T AS VP ETT AP PA 491 

Qy 698 P DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPVDLFSDDSIP — DVPQK 744 

11111:111 I : I I : I I I = I -1 

Db 492 AVPTEPIDVSVLSE-AAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDL 550 

Qy 745 QDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEV 804 

:: I : | :: : I :: M I ::' : : I 

Db 551 LEQTTSVPAVEAAESTSSPIPE TSLPPPNEAVASPEVAVAPITAPEPIPEPEP 603 

Qy 805 STLSKKEKIPLQ MEELSTAVYSNDDLFI SKEAQIRETETFSDSSPIEIIDEFPTLI 860 

| : | | | : : : : I II I : I I I III M I : 
Db 604 S LAT P T E P I P VEAP WI Q EAVD AV EVPVTETST SIP-ETTVEFPEAV 649 

Qy 861 SSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP HDLSL 906 

: | | I : I : : : : I I I I I Ml: 1 : : : 

Db 650 AEKV LDPAITEAPVTTQEPDVANINDGA PATEITTPAVEIVTAAAEVSDIAI 701 

Qy 907 KNIQPKVEEKI SFSDDFSKNGSATSKVLL LP PDVSALG 944 

| | | ::|: : : : : ::|:: :| I : I I 

Db 702 PVIDPPVPQEIAVA-EIPETDTKPAEVIVEQSTIPIEAPVPEVSKYAEPVISEAPAAEVP 760 

Qy 945 HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLY 996 

: | | : I I : II I : I : : : III I M : II 
Db 761 ITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEIPEQSSSPSD— SVPVAK— ITPLL- 814 

Qy 997 WRDIKKTGV 1005 

II:: I I 
Db 815 -RDLQTTDV 822 



RESULT 10 
A47283 

calphotin - fruit fly (Drosophila melanogaster ) 



C; Species: Drosophila melanogaster 

C;Date: 21-Sep-1993 #sequence_revision 25-Apr-1997 #text_change 21-Jul-2000 
C; Access ion: A47283 

R;Ballinger, D.G.; Xue, N . ; Harshman, K.D. 

Proc. Natl. Acad. Sci. U.S.A. 90, 1536-1540, 1993 

A;Title: A Drosophila photoreceptor cell-specific protein, calphotin, binds 
calcium and contains a leucine zipper. 

A; Reference number: A47283; MUID : 93165730 ; PMID: 8434015 

A; Contents: photoreceptor cells 

A;Accession: A47283 

A; Status: preliminary 

A;Molecule type: nucleic acid 

A; Residues: 1-873 <BAL> 

A; Cross-references: GB:L05080; NID:gl57071; PIDN : AAA28420 . 1 ; PID:gl57072 
A;Note: sequence extracted f rom NCBI backbone (NCBIN : 124958 , NCBIP : 124 959 ) 
C;Superfamily: collagen alpha 1(1) chain; fibrillar collagen carboxyl-terminal 
homology; von Willebrand factor type C repeat homology 

Query Match 5.4%; Score 320.5; DB 2; Length 873; 

Best Local Similarity 22.5%; Pred. No. 6e-06; 

Matches 235; Conservative 122; Mismatches 367; Indels 321; Gaps 50; 

Qy 59 P AAGL S AAP V- P T AP AAGAP LMD FGND FVP PAP RGP L P AAP PVAPERQPSW-DPSP 112 

| : I I I I I : I I i :: III I MM MM I : I 

D b 9 PVSAPVAAPWPSAVAAPVQWSPAAVAVAPAWAPAPAAPTAVTPVAP — PPTLASVQP 66 

Qy H3 VSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPP A 163 

: | | M | : | : : II : I : I II I III I II Ml 

Db 67 ATVTVPAPAPIAAASVAP VASVAPPWAAPTPPA ASPVSTPPVAVAQIPVAV 118 

Qy 164 PAPAAPP STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLET — 217 

Mill II I I : : I I I 

D b 119 SAPVAPPVAATPTPVAP IPVAAPVIATPPVAASAPT 154 

Qy 218 -AASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLT 276 

|| I :||: I Ml I II: =1 " I 

Db 155 PAAVTPWSPVIAT PPWPANTT VPVAAPVAAVPAAVPWAPVLA 199 

Qy 277 EFSELEYSEMGSSFSVSPKAESAVIVAN PREEI IVKNKDEEEKLVSNNI LHXQQE 331 

: I : I I : II I III 
Db 200 P AVAPAV — APWAET PAP P PVAE I P VAT 22 6 

Qy 332 LPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDM 391 

: I : I : I II I : I I I I I I 

Db 227 IPECVAPLIPEVSWAT KPLAAAEPVWAPPATET PWAPAAASPH 272 

Qy 392 LAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCA 451 

:: : I : : : I II = : | : I I 

Db 273 VSVAPAVETAWAPVS ASTEPPV— AAATLTTA 303 

Qy 452 PFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFV7VAQDS 511 

IN I M I Ml 
Db 304 PETPAL APWAESQ VAA 320 

Qy 512 ETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEV 571 

I I I I I : I : I MM MM : I I 

Db 321 NTWATPPTPAPEPETIAPPW AETPEVASVAVA-ETTPPW — PPV 3 64 



Qy 


572 


MQESLYPA AQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSS 

II * - 1 • 1 1 1 • • • 1 • 1 1 1 * 1 
I | : i | ll * • l • l II •i*lll • I 

AAE SI- PAP WAT T P VP AT LAVT D P D VT AS AVP ELP P VI AP S P VP SAVAET P VD LAP P VL 


622 


Db 


365 


423 


Qy 


623 


SPLEASSVNYESIKHEPENPPPYEEAMSV-SLKVSGIKEEIKEPENINAALQETEAPYIS 

1.1 1 > Mil .... 1 . . | 1 | 11*1 


681 


Db 


424 


1 : 1 | . 1 1 1 1 .... | . . | | . | ll*l 

P PVAAE P VP AWAEET P ET PAPAS AP VT I AALD I P E VAP VI AAP S DAPAEAP SAAAP I VS 


483 


Qy 


682 


IACDLIKETKLSAEPAP DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPV 

■ ill ll 111*111 1 * 1 1 • 1 
: : 1 1 1 II 1 1 1 • 1 1 1 1 • 11*1 

T P P T T AS VP ET T AP P AAVP T E P I DVS VL S E-AAI ET P VAP P VEVT T EVAVAD VAP P EAAE 


730 


Db 


484 


542 


Qy 


731 


DLFSDDSIP — DVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESF 
ii • i . i • • i • i • • * i ••iti 


788 


Db 


543 


1 1 : 1 : I : : 1 : 1 : - • 1 ■ • If 1 

DLIIEPVEPPAPIPDLLEQTTSVPAVEAAESTSSPIPE -TSLPPPNEAVASPEV 


595 


Qy 


789 


KLSLDNTKDTLLPDEVSTLSKKEKI PLQ MEELSTAVYSNDDLFI SKEAQIRETETF 

. . 1 1 . 1 ! I . . • • 1 t 1 1 • 1 1 t 

: : : : II : Ml- • • 1 I 1 1 • 1 1 1 

AVAPITAPEPIPEPEPSLATPTEPIPVEAPWIQEAVDAV EVPVTETST- 


844 


Db 


596 


644 


Qy 


845 


SDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP — 

iii lit . . i i i- mi til* 
III III : : 1 1 1 - 1 • • * * 1 1 III III* 

— SIP-ETTVEFPEAVAEKV LDPAITEAPVTTQEPDVANINDGA PATEITTP 


901 


Db 


645 


693 


Qy 


902 


HDLSLKNIQPKVEEKI SFSDDFSKNGSATSKVLL LP PDVS 

i*. iii.. i... • • ..i.. * i 1*11 
I : : | | | : : | : : : : : . . | . . . | 1*11 

AVEIVTAAAEVSDTAIPLIDPPVPQEIAVA-EIPETETKPAEVIVEQSTIPIEAPVPEVS 


941 


Db 


694 


752 


Qy 


942 


ALG HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSA 

: 1 1 : 1 1 : 1 1 1 : 1 : : : III 
K YAE P VI S EAP AAE VP I TAGDN P DNT S VG I S E W- PT I AE K P VEE VP T S E I P EQ S S S P S D 


980 


Db 


753 


811 


Qy 


981 


IFSADLGKTSWDLLYWRDIKKTGV 1005 




Db 


812 


1 : 1 : II 1 1 : : 1 1 
--SVPVAK — ITPLL — RDLQTTDV 830 





RESULT 11 
T34418 

hypothetical protein F12F3.3 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 29-Oct-1999 

C;Accession: T34418 

R; Fulton, B.; Wohldmann, P. 

submitted to the EMBL Data Library, July 1998 

A; Description: The sequence of C. elegans cosmid F12F3. 

A; Reference number: Z21521 

A;Accession: T34418 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-3488 <FUL> 

A;Cross-references: EMBL:U80022; PIDN : AAC25885 . 1 ; GSPDB : GN00023 ; CESP:F12F3.3 

A; Experimental source: strain Bristol N2; clone F12F3 

C; Genetics : 

A; Gene: CESP:F12F3.3 

A;Map position: 5 



A;Introns: 281/3; 332/1; 562/3; 600/3; 1866/3; 1944/3; 3393/1 



Query Match 5.1%; Score 299.5; DB 2; Length 3488; 

Best Local Similarity 20.6%; Pred. No. 0.0003; 

Matches 272; Conservative 187; Mismatches 475; Indels 387; Gaps 57; 

Qy 1 MEDLDQSPLV--SSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEV— — 54 

::::| I :: I : :| | : ::: I I I : I I : : |::| 

Db 340 VDEVDDSTVLEEKKDDGDDKSKPKTKKKIIKKKETPESEQVTAAEPEQQKISEVDVQSVA 399 

Qy 55 LERKPAA GLSAAPVPT APAAGA PLMD FGNDF 85 

: : I I I III: II II 

Db 400 ETEVGAKKKPDAEKPTDLSKAKKDSKSKKSDEPEASTEEKSTTEKPTNDKTSKKSAEKKT 459 

Qy 86 VPPAPR GPLPAAPPVAPERQPSWDPSPVSSTVPA PSPLSAAAV 128 

II I I I I I :: I I I : I I : I : 

Db 460 VKPKKEVTGKPLEAKKPVEDKKDASQPSSSKESSPPTDGKKKKQIPKALFIPDEISSRFG 519 

Qy 12 9 SPSKL PEDDEPPARPP — PPPPASVS PQAEPVWTPPAP 164 

II: : I : I I I I I I : I I : : 

Db 520 DPSTMHSETNITTTIRGREGSADAKTPLVEPLSASVSMKVFTLVESAKEKAEFSFKRRSE 57 9 

Qy 165 AP AAPPSTPAAPK RRGSSGAWXXXXKIMD — — LKEQPGN 201 

I I |: : I : I: I:: I :| ::| I : 

Db 580 TPDDKSRKKEGLPPAKKSEKKDEVTAEKQSTEALIESKKKEVDESKISEQQPSDKNKSEV 639 

Qy 202 TISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVL PTEGTLQEN 254 

: I I I I I : : I : I I I : : : : 

Db 64 0 VGVPEKAAGPETKKDV — SEIEEVPKKKTIKKKTEKSDSSISQKSNVLKPADDDKSKSDD 697 

Qy 255 VSEASKEVSEKAKTLLID RDLTEFSELE YSEMGSSFSVSP 294 

I : : I I : : I : I I h I I : I I I : I 

Db 698 VTDKSKKTTEDQTKVATDSKLEKAADTTKQIETETWDDKSKKKVLKKKTEKSDSF-ISQ 756 

Qy 295 KAESAVIV — AN PRE EIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKED 343 

I : I : : I II : I | | : : : | | : I : : : I I I : I 
Db 757 KSETPPWEPTKPAESEAQKIAEWKAKKQKEVDDNL KREAEVAAKKI ADEKLKI EA 813 

Qy 344 EWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAG 395 

II :::| I: : :: I :: |: I : : : I : I I 
Db 814 EANI KKTAEVEAAKKQKEKDEQLKLETEWS KKSAAEKLELEKQAQI KKAAEAD AVK 870 

Qy 396 GKIESNLESKVD — KKCFAD — SLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCA 451 

: I I : : I : : II II I I : : I : I : I I I : : I 

Db 871 KQKELNEKNKLEAAKKSAADKLKLEEESAAKSKKVSEESVKF GEEKKTKAGEKTVQV 927 

Qy 452 PFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDS 511 

I : : : I I II I I : I II : I I I I I : I : : I I 

Db 92 8 ESEPTSKKTIDTKDVGATE-PADE — TPKKKIIKKK TEKSDSS ISQKS 972 

Qy 512 ETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGTK IAYETKMDL 565 

II : : : : I I : I : I I I : I : : I I I I : I I : 

Db 973 ATD SEKVSKQKEQDEPTKPAVSETQMVTEADKSKKQKETDEKLKLDAEIAAKTKQEA 1029 

Qy 566 VQTSEV-MQESLYPAAQLCPSFEESEATPSPVLPD-IVMEAPLNSAVPSAGASVIQPSSS 623 

: |:: II : : hi I I : :|: : : || :: : 

Db 1030 DEKSKLDAQEKIKKVS EDDAARKEKELNDKLKLESEIATKKASADKLKLEEQAQ 1083 



Qy 624 PLEASSVNYESIKHEPENPPPYE-EAMSVSLKVSGIKEEIKEPENINAALQETEAPYISI 682 

:|: I I: I : I : : : i | : | |::: I | | : 

Db 1084 AKKAAEV— EAAKKQKEKDEQLKLDTEAASKKAAAEKLELEKQAQIKKA AGADAVKK 1138 

Qy 683 ACDLIKETKLSAEPAPDFS DYSEMAKVEQPVPDHSEL VE 721 

:| :: II I : II :l I : ::l I 

Db 1139 QKELDEKNKLEANKKSAAGKLKIEEESAAKSKQTVEEQAKLDAQTKAKTAEKQTKLEKDE 1198 

Qy 722 DSSPDSEPVDLF SDDSI PDVP 742 

I : : I I : I I I I I 

Db 1199 KSTKESESKETVDEKPKKKVLKKKTEKSDSSISQKSETSKTWESAGPSESETQKVADAA 1258 

Qy 743 QKQDET VMLVKESLTETSFESMIEYENKEKLSA LPP 778 

: I I I I : I : I I : I : I I : I I : I I 
Db 1259 RKQKETDEKQKLEAEITAKKSADE KSKLEAESKLKKAAEVEAAKKQKEKDEQLKLDT 1315 

Qy 779 EGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKIPLQME 818 

I : I I I : I I : I : I I : I : : 

Db 1316 EAASKKAAAEKLELEKQSHIKKAAEVDAVKKQKELEEKQRLESE7^TKKADAEKLKLEEQ 1375 

Qy 819 ELSTAVYS NDDLFISKEAQIRETETFS DSSPIEIIDEFP 857 

: I : I : I I I I : I I I : : I I : I I I 

Db 1376 KKKAAEIALIEIQKEQEKLAQEQSRLEDEAKKSAEKQKLESETKSKQTEEAPKESVDEKP 1435 

Qy 858 — TLISSKTDSFSKLAREYTDLEVSHKSEIANAP-DGAGSLPCTELPHDLSLKNIQPKVE 914 

: : I I | : | : | M : I : I I : I I : I I III 
Db 1436 KKKVLKKKT EKSDSSISQKSKSAKSTVDAAETL ESDFNLVEKKTVQKVE 1484 

Qy 915 --EKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVK 955 

I : II I I : : I III : : I I 

Db 1485 QSPDESTSATIKRDPAQKTEEISKQDDGDEKKTTTDGKPPKPEDSEA TPKKRWK 1539 

Qy 956 PK VLEKEAEKKLPSDTEKEDRSPSAI FSADLGKTSWD 993 

I I I I : I I III I I I I I I : 

Db 1540 KKTQKSDSVASDASLADVSKLSDDVEEKPKKKVLKKKTEKSDSVISETSSVDTIKPESVE 1599 

Qy 994 L 994 

Db 1600 I 1600 



RESULT 12 
S37431 

ankyrin 2, neuronal long splice form -human 

N;Alternate names: ankyrin B, 440K splice form; ankyrin-B; brain ankyrin; non- 
erythroid ankyrin 

N;Contains: ankyrin 2, short form 
C; Species: Homo sapiens (man) 

C;Date: 06-Jan-1995 #sequence_revision 06-Jan-1995 #text_change 13-Aug-1999 
C;Accession: S37431; A39643; B39643; A40334; A49462; S14533; S14569 
R;Chan, W. 

submitted to the EMBL Data Library, September 1993 
A; Reference number: S37431 
A; Access ion: S37431 
A; Status: preliminary 
A;Molecule type: mRNA 



A; Residues: 1-3924 <CHA> 

A;Cross-references: EMBL:Z26634; NID:g406287; PIDN: CAA81387 . 1; PID:g406288 
R;Otto, E.; Kunimoto, M. ; McLaughlin, T.; Bennett, V. 
J. Cell Biol. 114, 241-253, 1991 

A; Title: Isolation and characterization of cDNAs encoding human brain ankyrins 

reveal a family of alternatively spliced genes. 

A; Reference number: A39643; MUID : 913024 66 ; PMID: 1830053 

A; Accession: A3 9 64 3 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-2077 <0T1> 

A; Cross-ref erences : GB:X56957 

A; Accession: B39643 

A; Status: preliminary 

A; Molecule type : mRNA 

A; Residues: 1-1443,3585-3924 <0TT> 

A; Cross-references : EMBL:X56958 

R; Tse, W.T.; Menninger, J.C.; Yang-Feng, T.L.; Francke, U. ; Sahr, K.E.; Lux, 
S.E.; Ward, D.C.; Forget, B.G. 
Genomics 10, 858-866, 1991 

A; Title: Isolation and chromosomal localization of a novel nonerythroid ankyrin 
gene . 

A; Reference number: A40334; MUID : 92 00992 1 ; PMID: 1833308 
A; Access ion: A40334 
A;Molecule type: DNA 

A;Residues: 463-474 , ' PE ', 477-4 95 <TSE> 

A;Cross-references: GB:M37123; NID:gl78647; PIDN : AAA62828 . 1 ; PID:gl78648 
R;Chan, W. ; Kordeli, E.; Bennett, V. 
J. Cell Biol. 123, 1463-1473, 1993 

A; Title: 440-kD ankyrinB: structure of the major developmentally regulated 
domain and selective localization in unmyelinated axons. 
A; Reference number: A49462; MUID : 940754 09 ; PMID: 8253844 
A;Accession: A49462 

A; Status: preliminary; nucleic acid sequence not shown 
A;Molecule type: mRNA 
A; Residues: 1-3924 <RES> 

A;Cross-references: EMBL:Z26634; NID:g406287; PIDN : CAA81387 . 1 ; PID:g406288 

C; Genetics: 

A; Gene: GDB : ANK2 

A/Cross-references: GDB: 127607; OMIM: 106410 
A;Map position: 4q25-4q27 

C; Superf amily : ankyrin; ankyrin repeat homology 
C; Keywords: alternative splicing 

F;2-3924/Product: ankyrin 2, long form ((status predicted <MAT> 

F;2-1443, 3585-3924/Product: ankyrin 2, short form #status predicted <MA2> 

F; 63-95/Domain: ankyrin repeat homology <AN01> 

F; 96-128/Domain : ankyrin repeat homology <AN02> 

F; 129-161/Domain: ankyrin repeat homology <AN03> 

F; 162-190/Domain: ankyrin repeat homology <AN04> 

F; 191-223/Domain: ankyrin repeat homology <AN05> 

F;232-264/Domain: ankyrin repeat homology <AN06> 

F;265-297/Domain: ankyrin repeat homology <AN07> 

F;298-330/Domain: ankyrin repeat homology <AN08> 

F;331-363/Domain: ankyrin repeat homology <AN09> 

F; 3 64- 3 9 6/ Domain : ankyrin repeat homology <AN10> 

F; 397-42 9/Domain : ankyrin repeat homology <AN11> 

F; 430-4 62 /Domain : ankyrin repeat homology <AN12> 



F; 463-495/Domain 
F; 4 96-52 8 /Domain 
F; 529-561/Domain 
F; 562-594/Domain 
F;595-627/Domain 
F; 628-660/Domain 
F; 661-693/Domain 
F; 694-726/Domain 
F;727-759/Domain 
F; 760-792/Domain 
F;793-825/Domain 



ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 
ankyrin repeat 



homology <AN13> 
homology <AN14> 
homology <AN15> 
homology <AN16> 
homology <AN17> 
homology <AN18> 
homology <AN19> 
homology <AN20> 
homology <AN21> 
homology <AN22> 
homology <AN23> 



Query Match 5.0%; Score 299; DB 2; Length 3924; 

Best Local Similarity 21.5%; Pred. No. 0.00037; 

Matches 258; Conservative 167; Mismatches 417; Indels 358; Gaps 62; 
Qy 14 DSPP RPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLE ELEVLERK 58 



Db 1648 DIPPDETQSTQKQHKPSLGIKKPVRRKLKEKQKQKEEGLQASAEKAELKKGSSEESLGED 1707 

Qy 59 PAAGLSAAPVPTAPAAGAPLMD- FGNDFVPPAPRG 92 

I I I : I : I I I : I I :": I :| :| 

Db 1708 P — GLAPEPLPTVKAT-SPLIEETPIGSIKDKVKALQKRVEDEQKGRSKLPIRVKGKEDV 1764 



Qy 93 PLPAA-PPVAPERQPSWDPSP VSSTVPAPSPL 123 

11111:11 III II II: 

Db 1765 PKKTTHRPHPAASPSLKSERHAPGSPSPKTERHSTLSSSAKTERHPPVSPSSKTEKHSPV 1824 



Qy 124 SAAA VSPSKLPEDDEPPA RPPPPPPASVSPQAEPVW 159 

I : I I I I I I I : | | | | : : : M 

Db 1825 SPSAKTERHSPASSSSKTEKHSPVSPSTKTERHSPVSSTKTERHPPVSPSGKTDKRPPV- 1883 

Qy 160 TPPAPAPAAPPSTPAAPKRR GS SGAWXXXXKIMDLKEQPGNTI SAGQEDFPSVLLE 216 

: I I I : I : : I II I I I : I : : I 

Db 1884 SPSGRTEKHPPVSPGRTEKRLPVSPSGRTD KHQPVSTAGKTEKHLPVSPSG 1934 



Qy 217 TAASXPSLSPLSAAS-FKEHEYLGNLSTVLPT EGTLQENVSEASKEVSEKAKT 2 68 

I : I I I : I : I : : | | : | hill 

Db 1935 KTEKQPPVSPTSKTERIEETMSVRELMKAFQSGQDPSKHKTGLFEHKS7VKQKQPQEKGKV 1994 

Qy 269 LLIDRDLTEFSELEYSEMGSSFSVSPKAES — AVIVANPREEIIVKNKDE — 316 

: I t : : : I : : I I : III I I : : I I I : I 

Db 1995 RVEKEKGPILTQRE-AQKTENQTIKRGQRLPVTGTAESKRGVRVSS 1 GVKKEDAAG 2049 



Qy 317 -EEKLVSNNI LHXQQELPTALTKLVKE DEW 34 6 

: I I : : I : I I : I : I hi I h 

Db 2050 GKEKVLSHKIPEPVQSVPEEESHRESEVPK — EKMADEQGDMDLQI S PDRKTSTDFSEVI 2107 



Qy 347 S S E- KAKDS FN E K RVAVEAPM R E E Y AD FKPFERVWEVKDSKED 388 

I : I : : h : I : : I II : : I : : 

Db 2108 KQELEDNDKYQQFRLSEETEKAQLHLDQVLTSPFNTTFPLDYMKDEFLPALSLQSGALDG 2167 



Qy 389 -SDMLAAGGKIESNLESKVDKKCFADSLEQTNHE KDSESSNDDTSFPSTPEGIKDRS 444 

hi I I I : : II II : | : | : | | : | : : : : : 

Db 2168 SSESLKNEGVAGSPCGSLMEGTPQISS EES YKHEGLAET PETS PES LSF — SPKKSEEQT 2225 

Qy 445 GAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPF 504 



i • ii i i i • • • i i i - i . 

Db 2226 G ETKESTKTETTTEIRSEKEHPTTKDITGGS — EERGATVTEDSETSTESFQKE 2277 

Qy 505 FVAAQDSETDYWTDNLTKVTEEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKM 563 

: I: I :| I III :|| I |: |::| 

Db 2278 ATLGSPKDTSPKRQDDCTGSCSVALAKETPTGLT EEAACDEGQRTFGSS-AHKT — 2330 

Qy 564 DLVQTSEVMQESLYPAAQLCPSFEESEATPSPVL PDIVMEAP— LNSAV 610 

M III : : | : : | | | I I : : I I I : 

Db 2331 QTDSEAQES TATSDETKALPLPEASVKTDTGTESKPQGVIRSPQGLELAL 238 0 

Qy 611 PSAGASVIQ PSSSPLEASSVNYESIKHE-PEN--PPPYEEA-MSVSLKVSG 657 

II : I = I I I I I : : I : I : : I I : I : I I : I 

Db 2381 PSRDSEVLSAVADDSLAVSHKDSLEAS PVLEDNSSHKTPDSLEPSPLKESPCRDSLESSP 244 0 

Qy 658 IKEEIKE PEN — INAALQETE — APYI SIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

: : : : I I : ': I I : : I I I : I : : : I I I : I : I 

Db 2441 VEPKMKAGIFPSHFPLPAAVAKTELLTEVASVRSRLLRD PDGS--AEDDSLE 2490 

Qy 711 QPVPDHSELVEDS SPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I : I : I I III: : I : I :: | | 
Db 2491 Q TSLMESSGKSPLSPDTPSSEEVSYEVTPKTTDVSTPKPAVIHECAEED 2539 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKIPLQMEELS 821 

: I I I I II III: I | : | | : : | : : I I I I I 

Db 2540 DSENGEKKRFTPEE EMFKMV TKIKMF-DELEQEAKQKRDYKKEPKQEESSS 2589 

Qy 822 TAVYSNDDLFISKEA-QIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 880 

: I : I | : : : | : | | : I I : : I : : I : 
Db 2590 S SDPDADCSVDVDEPKHTGSGEDESGV PVLVTSESRKVSSSSES 2633 

Qy 881 HKSEIANAPDGAGS LPCTELPHDLSLKN IQPKVEEKISF — SDD 922 

: I : I III I : | | : : I I I : : : I : : I 

Db 2634 -EPELAQLKKGADSGLLPEPVIRVQPPSPLPSSMDSNSSPEEVQFQPWSKQYTFKMNED 2692 

Qy 923 FSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSD TEKEDRS 977 

= : : I I : : I | | I I I I I I 

Db 2693 TQEEPGKSEE EKDSESHLAEDRHAVSTEAEDRS 2725 



RESULT 13 
T16251 

hypothetical protein F35A5.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 18-Feb-2000 
C; Accession: T16251 
R;Leimbach, D. 

submitted to the EMBL Data Library, January 1996 

A; Description: The sequence of C. elegans cosmid F35A5. 

A;Reference number: Z18485 

A; Accession: T16251 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-1274 <LEI> 

A;Cross-referenc.es: EMBL:U46675; NID : gll66613 ; PID : gll66621 ; PIDN : AAB52641 . 1 ; 
GSPDB:GN00028; CESP:F35A5.1 

A; Experimental source: strain Bristol N2 ; clone F35A5 



C; Genetics: 
A; Gene: CESP:F35A5.1 
A; Map position: X 
A;Introns: 1272/2 

Query Match 4.9%; Score 290.5; DB 2; Length 1274; 

Best Local Similarity 20.5%; Pred. No. 0.00017; 

Matches 221; Conservative 126; Mismatches 417; Indels 313; Gaps 45 

Qy 13 SDSPPRP — QPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPT 70 

I : I I I III:: I I I I I I I I I : I I : I I : I , 

Db 2 SRAPPTPIKNPAKKWKPPWESVDEEEEME VDEETPAPSKLEKKPSLKRKDAPTKP 56 

Qy 71 APAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSP 130 

hill I : I I I : I I I : I I | 
Db 57 VPSPGAP SPVPIKNPVKKWKAPWEDDEPMEEAPAAP VP 94 

Qy 131 SKLPEDDEP PARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWX 187 

: I II I I : I I : : I I I I I I I 

Db 95 AKKVRDP S PKKVPAKPRDAS PKKIMAAKK EPETLPAVPP-TPVKNPVKKFKAPWED 149 

Qy 188 XXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXP-SLSPLSAASFKEHEYLGNLSTVLP 246 

: I : I : I I : I : I I : I II II III: : | | 

Db 150 DEVDVEDVKDAP — TVPAKKTPVLKKKEPAAAAKPRDPSPKKAAPSKEHDPI VPP 2 02 

Qy 247 TEGTLQENVS EASKEVSEKAKTLLIDRDL — TEFSELE YSEMGSSFSVSPKA 296 

I I : : | | | : : | | | : | | | | : 

Db 203 T PIKNPAKKWKPPWEDDEVPTEEIKEPEPATRKVPALKKKEPSTSVKPVS 252 

Qy 297 ESAVIVANPREEIIVKNKDE EEKLVSNNILHXQQELPTALTKL 339 

: :| ::: | | : | | :: : : : | : 

Db 253 D PSPTKKVPVKKEPEVPPTPIKNPTKKWKPPWEDETPVEEV— KEPPVPEKKAPV 305 

Qv 340 VKEDEWSSEKAKDSFNEKR VAVEAPMREEYADFKPFERVWEVKDSKEDS 389 

= I : : : I I : I I I I : : : | | | | I I : 

Db 306 LKKKDPAPAAKARDPSPSKAAPKKVEPSSPWPPTPVKNPVKKYKP PWEVDDEPAE- 361 

Qy 390 DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYIT 449 

I I :M I:: I I : : I I :| | : 

Db 362 EVKKPSAPEKK — TPVLKRKEPEPSSTTPSSDPSPKKAAPAVKPRDSSPKK 410 

Qy 450 CAPF — NPAATESIATNIFPLLEDPTSE NXTDEKKIEE-KKAQIVTEKNTSTKTS 501 

I : I I I I : : : I : I I : I I I : : : I I 

Db 411 AT P LQAD P KAQ EVP P T P V KNPVKKYKPPWEVDDEDPVEEVKQPEAPAKKTPVLKRK 466 

Qy 502 NPFFVAAQDSETDYVTTDNLTKVTEEVVANMPE GLTPDLVQ 542 

I I I : I I III II Ml 

Db 467 EP AAKD TAKPATSKTPET P EKKD P VK P RD S S P KKVAAK P D S AQAP AT 513 

QY 543 EACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEES 589 

I I : I : I : I : : I I I I I : 

Db 514 PVKNPVKKWRPPWEDDETPADDVSKPTDAKKTPSLAKKDPAPAKESLKPKADTKAPAKPR 573 

Qy 590 EATPSPVLPDIVMEAPLNSAVP3AGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAM 649 

: : I I I I I : I I I I : I : I I : 

Db 574 DPSPKKVAP TAPEKKTPVLAKKEPAGPADSKTKEPEKSKPRDPSPKKAVP 623 



Qy 650 S VSLKVSGIK— EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

• : : : I I I : | : : I : i : | -| 
624 AKPVPKTEVAPAAVKKPEPISKPKDTAPKKAEPNSPW PPTP 665 



Db 



7 03 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 
1= II M ::|| I I: : :| :| |:: |: | 

Db 666 VKNPVKKWKPPWEDDDAPAKPVSL PEPEKKTPVLAKKAPTKPDSE 710 

Qy 7 63 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 
Db 711 AAADPVSGP 719 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEII DEFPTLISSKTDSFSKLAREYT 875 

I : I : : I : I : : III::: I | : | : : | 
Db 720 SSKDPKLAKKAPVKP RD P S PMKAVP I KPAP KT EVP P AWKK P E P VAK 766 

Qy 876 DLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVE EKISFSDDFSKNGSA 929 

: I I I I : :| | | : I : I I | :: : | 

Db 767 SRDPSPKK — AKAEPNSPWPPT — PVKNPVKKWKPPWEDDDAPAEPVNVPEPEKKTPVL 822 

Qy 930 TSKVLLLPPDVS ALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSP 978 

l=IM II I : I | : : | | II : I I [ 
Db 823 AKKT PVKPRDPSPKKAVPAKPSTKTDAPPVSVKKPEPVSKPKE PSPKKAEPNSP 876 



RESULT 14 
A56577 

microtubule-associated protein MAP IB - rat (fragment) 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Jul-1995 #sequence_revision 21-Jul-1995 #text_change 16-Feb-1997 
C;Accession: A56577 

R;Zauner, W. ; Kratz, J.; Staunton, J.; Feick, P.; Wiche, G. 
Eur. J. Cell Biol. 57, 66-74, 1992 

A; Title: Identification of two distinct microtubule binding domains on 
recombinant rat MAP IB. 

A; Reference number: A56577; MUID : 92347374 ; PMID: 1639092 

A;Accession: A56577 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-2364 <ZAU> 

A; Cross-references : GB:X60550 

A; Experimental source: brain 

A;Note: nucleotide sequence not given; conceptual translation not complete 
C; Superf amily : microtubule-associated protein MAP IB 

Query Match 4.9%; Score 289; DB 2; Length 2364; 

Best Local Similarity 20.9%; Pred. No. 0.00047; 

Matches 275; Conservative 158; Mismatches 462; Indels 422; Gaps 61; 

Qy 30 EPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPA 89 

I Ml: :: II :: | Mill : ||: 

Db 909 EAEQSEEEGEEEEDKAEDAREEDHEPDKTEAEDYVMAWDKAAEAGVTEDQY — DFL 963 

Qy 90 PRGPLPAAPP— VAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPP 147 

M I :l hi: I hi I || |:| | 

Db 964 — GTPAKQPGVQSPSREPA— SSIHDETLPGGSESEATA SDEENREDQPEEF 1011 



148 PAS VS PQAE P VWT P P AP AP AAP P S T P AAP KRRG S S GAWXXXXK IMDLKEQ P GNT I S AGQ 207 
I • : I I I I I : | : | | | 

Db 1012 TATSGYTQSTIEISSEPTPMDEMSTP RDVMTDETNNEETESPSQ 1055 

208 E DFPSVXLETAASXP SLSPLSAASFKE HEYLGNLSTVLPTEGTLQENV 255 

I : I I II I : II I : : I : I 1 : I :: 

Db 1056 EFVNITKYESSLYSQEYSKPWAS FNGLSDGSKTDATDGRDYNASASTISPPSSMEEDKF 1115 

Qy 256 SEAS KEVSEK AKTLLIDRDLTE 277 

1 ::: i = M :: II I : I : 

Db HI 6 SKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERSV-N 1174 

Qy 278 FS ELEYSEMGSSFS-VSPKAESAVI VAN PRE EIIVKNK 314 

M I : : I I : : I I I I I : I : I I | : : : : 

Db 1175 FSLTPNEIKASAEGEATAWSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTPYY 1234 



Qy 3i5 DEEEKLVSNNILHXQQELPTALTKLVKEDEVVSSE--KAKDSFNEKRVAVEAPMRE 368 

II- : : 1:1 I I I : I I I I I : : : M I 

1235 QSPTDEKSSHLPTEVTENAQAVP VSFEFTEAKDE-NER— SSISPMDE 1279 



Db 



Qy 369 EYADFK-PFERVWE VKDSKED SDMLAAGGKIESNLESKVDKKCFADSL 415 

1 : I I : I : : | | : I I I : I I I I | : | : | 

Db 1280 PVPDSESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSD-- 1337 

QY 416 EQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNP AATESIATNIFPLLED- 471 

I : I I I I I : : I : I I : | | : | : : | | 

Db 1338 KESPVS-DLTSDLYQDKQEEKRAGFIPIKEDFSPEKKASDAEIMSSQSALALDE 1390 

QY 472 PTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

M:: : I : : I I I : : | | : | : | : s 

Db 1391 RKLGGDGSPTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT— YSHMEGVAS 1447 

Qy 524 VTEEWA— NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAA 580 

> : M : II I I I : :|: |:: : : ||| ||: : |: 

Db 1448 VSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSK 1506 

Qy 581 QLCP SFEESEATPSPVLPDIVMEAPLNSAV P 611 

= 11 :: : I : : |:| :| | 

Db 1507 EECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHP 1566 

Qy 612 SAGASVIQ PSSSPLEASSVNYESIKHE— PENPPPY EEAMSVS-LKVSG 657 

: II :: l ; I : h I: I II I :||| :: | 

Db 1567 TVGAGMLHITENGPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLSELISVSQVEASP 1626 

Qy 6 58 IKEEIKEPENINAALQETE APYISIACDLIKE TKLSAEP 696 

11:111 : I hi : :: Ml : 

Db I 627 STSSAHTPSQIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKLSPKSDISPLTPR 1686 

Qy 697 APDFSDYSEMAK VEQPVPDHSELVEDS 723 

=1 Ml : M :| | :: |: 

Db 1687 ESSPTYSPGFSDSTSGAKESTAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQHHLALSRD 1746 

Qy 724 SPDSEPVDLFSDD SIPDVP QKQD 746 

IMIII: II : | : 

Db 1747 LTTSSVEKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYESHEKTIQAHDVGGYYYEKTE 1806 



Qy 



747 ETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKD 797 



Db 



EISEK 1858 



QY 798 TLLPDEVS — TLSKKEKIPLQMEELSTAVYSNDD LFISKEA 836 

I llllll: : : : : | : | | , . 

Db 1859 TTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESES 1918 

8 37 QIRETETFSDSSP IEIIDEFPTLISSKTDSFSKLAREYT DL 877 

II I : II I ::: : | |: | || | | 

Db 1919 YSYETTTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTPERKSPSEARQDVDL 1978 

^ 87 8 EVSH-KSEIANA— PDG AGSLPCTELPHDLSLKNIQP KV 913 

I I I : I : : : I : I I I I I : | : 

Db 1979 CLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESERPLTQSGGAPPPSGGKQQGRQC 2038 

Qy 914 EEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKK-LPSD 970 

; l I i I I II: IN I: | | : :|:| 

Db 2039 DETPPTSVSESAPSQTDSDV— PPETE ECPSITADANLDSEDESETIPTDKT 2088 

Q y 971 TEKEDRSPS AIFSADLGKTSWDLLYWRDIKKTG 1004 

•It'll : I I I I | Ml 

Db 2089 VTYKHMDPPPAPMQDRSPSPRHPDVSMVDPEALAIEQNLGKALKKDLKEKAKTKKPG 2145 



RESULT 15 
QRMSP1 

microtubule-associated protein MAPlB - mouse 

N; Alternate names: microtubule-associated protein MAPI (X) ; microtubule- 
associated protein MAPI. 2; microtubule-associated protein MAPS 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text change 01-Sep-2000 
C;Accession: S07549; S44387; A33645 
R;Noble, M. ; Lewis, S.A.; Cowan, N.J. 
J. Cell Biol. 109, 3367-3376, 1989 

A; Title: The microtubule binding domain of microtubule-associated protein MAPlB 

contains a repeated sequence motif unrelated to that of MAP 2 and tau. 

A; Reference number: A33645; MUID: 90094539; PMID:2480963 

A;Accession: S07549 

A; Molecule type: mRNA 

A; Residues: 1-24 64 <NOB> 

A; Cross-references: EMBL:X51396; NID:g52999; PIDN : CAA357 61 . 1 ; PID:g53000 
R;Sanchez, C; Padilla, R. ; Paciucci, R. ; Zabala, J.C.; Avila, J. 
Arch. Biochem. Biophys . 310, 428-432, 1994 

A; Title: Binding of heat-shock protein 70 (hsp70) to tubulin. 

A; Reference number: S44387; MUID: 94234720; PMID: 8179328 

A; Accession: S44387 

A; Status: preliminary 

A;Molecule type: protein 

A;Residues: 653-663, f IC ? <SAN> 

C; Superf amily: microtubule-associated protein MAPlB 

C; Keywords: microtubule binding; phosphoprotein; tandem repeat 

F;589-786/Domain: microtubule binding #status experimental <MTB> 

F; 589-592, 639-642, 649-652, 655-658, 660-663, 668-671, 674-677 , 679-682 , 683-68 6, 687- 

690,691-694,695-698,699-702,708-711,712-715,716-719,720-723,727-730,758-761,764- 
767, 783-786/Region: 4-residue repeats (K/R-K-E/D-X) 
F;1861-2064/Region: 17-residue repeats 



F; 91,116, 351, 888, 1124, 1153, 1168, 1208, 1662, 1877, 1918, 2003,2030, 2054, 2083/Binding 
site: phosphate (Ser) (covalent) #status predicted 

F; 147, 969, 1336, 1562, 1563, 1702, 1708, 1990, 2057, 2063, 2419/Binding site: phosphate 
(Thr) (covalent) #status predicted 

F;1953/Binding site: phosphate (Tyr) (covalent) #status predicted 

Query Match 4.9%; Score 289; DB 1; Length 2464; 

Best Local Similarity 19.8%; Pred. No. 0.0005; 

Matches 262; Conservative 148; Mismatches 474; Indels 442; Gaps 55; 

32 EDEEEEEEEEEEDEDEDL-EELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAP 90 
I I : Ml MM: II M : : | | | , : , 

Db 1009 EAEQSEEEGEEEDKAEDAREEGYEPDKTEAEDYVMAVADKAAElAGvTEEQYGY 1061 



^ 91 RGPLPAAPPV-APERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPA 149 

I I : : I | : | : | | : | | | | | . | , , 

1062 LGTSAKQPGIQSPSREPA — SSIHDETLPGGSESEATA SDEENREDQPEEFTA 1112 



Db 



Qy 150 S VS PQAE PVWT PPAPAP AAP P ST PAAP KRRGS S GAWXXXXKIMDLKEQ PGNT I S AGQED 209 

II III : | : | | M 

Db 1113 TSGYTQSTIEISSEPTPMDEMSTP RDVMSDETNNEETESPSQEF 1156 

Qy 210 FPSVLLETAASXPSLSPLSAASFK EHEYLGNLSTVLPTEGTLQENVSE 257 

M: I : Ml :| : M: I :: |: 

Db 1157 VNITKYESSLYSQEYSKPAVAS FNGLSEGSKTDATDGKDYNASASTISPPSSMEEDKFSK 1216 

Qy 258 AS KEVSEK— AKTLLIDRDLTEFS 279 

: : MM:: I I I : I : I I 

Db 1217 SALRDAYCSEEKELKASAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERSV-NFS 1275 

280 ELEYSEMGSSFSVSPKAESAVI VAN PRE EIIVKNK 314 

I" I I : MM M: |:| | |:: :: 

Db 1276 LTPNEIKVSAEGEARSVSPGVTQAWEEHCASPEEKTLEWSPSQSVTGSAGHTPYYQSP 1335 



Qy 315 "DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADF 373 
M : : : I = I : MM II: I : I I I | 

1336 TDEKS SHLPTEVSENAQAVPVS F EFSEAKDE-NER— ASLSPMDEPVPDS 1382 



Db 



Q y 374 K-PFERVWE VKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSE 425 

= 11:1 : I I : | | : | , 

1383 ESPVEKVLSPLRSPPLLGSESPYEDFLSADSKVLGR RSESPFE 1425 



Db 



Qy 426 SSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXT 478 

I M : I M I : I MM I | 

Db 1426 GKNGKQGFPDRESPVSD LTSTGLYQDKQEEKSTGFIPIKEDFGPEKKTSDVETMS 1480 

Qy 479 DEKKI EEKKAQI VTEKNTSTKTSNPFFVAAQDSETD 514 

1 1 M : : I : : | I I : : I I : I 

Db 1481 SQSALALDERKLGGDVSPTQIDVSQFGSFKEDT?CMSISEGTVSDKSATP--VDEGVAEDT 1538 

Qy 515 YVTTDNLTKVTEEWA NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVM 572 

I : : I : M : I I I I I : : | : | : : : : | | | 

Db 1539 YSHMEGVASVSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTF 1597 

Qy 573 QES-LYPAAQLCP s FEES EAT PSPVLPDI VMEAPLNSAV-- 610 

II: : I: : || :: : | : : |:| :| 

Db 1598 QETEMSPSKEECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSFAMD 1657 



<2y 611 P SAGAS VI Q PSSSPLEASSVNYESIKHE--PENPPPY EEAM 649 

I: MM: I: : h |: I II I : 

Db 1658 FSRQSPDHPTLGASVLHITENGPTEVDYSPCDIQDSSLSHKIPPTEEPSYTQDNDLSELI 1717 

Qy 650 SVS-LKVSGIKEEIKEPENINAALQETE APYISIACDLIKE TKLSAE 695 

I M : : I 11:111 : | | : | : : : | | | : 

Db 1718 SVSQVEASPSTSSAHTPSQIASPLQEDTLSDWPPREMSLYASLASEKVQSLEGEKLSPK 1777 

QY 696 P APDFSDYSEMAK VEQPVPDHSELVEDS- 723 

: I I I I : I I : I I : : I : 

Db 1778 SDISPLTPRESSPLYSPGFSDSTSAAKETAAAHQASSSPPIDAATAEPYGFRSSMLFDTM 1837 

Qy 724 SPDSEPVDLFSD DS 737 

M I I I I | 

Db 1838 QHHLALNRDLTTSSVEKDSGGKTPGDFNYAYQKPENAAGSPDEEDYDYESQEKTIRTHDV 1897 

QY 738 I PDVPQKQDETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESF 788 

: : I : i : I I : I : I II : I III I 
Db 18 98 VRYYYEKTERTIKSPCDSGYSYETIEKTTKTPEDGGYTCEITEKTTRTPEEGGYSY 1953 

QY 789 KLSLDNTKDTLLPDEVS — TLSKKEKIPLQMEELSTAVYSNDD 829 

: :: I III I I |: ::::| :| 

Db 1954 EISEKTTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEK 2009 

QY 830 LFISKEAQIRETETFSDSSP IEIIDEFPTLISSKTDSFSKLAREYT 875 

I I: II I : II I ::: : | |: | || 

Db 2010 ITSFPESESYSYETSTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTTEKKSP 2069 

QY 876 DL — EVSH-KSEIANA PDG AGSLPCTELPHDLSLKNIQP 911 

M I | |:|:: : |: II I I |: | 

Db 2070 SEARQDVDLCLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESEKPLTQSGGAPPPS 2129 

QY 912 KVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAE 964 

: : I II Mil: IN : : I | 

Db 2130 GGKQQGRQCDETPPTSVSESAPSQTDSDV PPETE ECPSITADANIDSEDE 2179 

QY 965 KK-LPSD TEKEDRSPS AI FSADLGKTSVVDLLYWR 998 

: : I : I =11111 :||| II 

Db 2180 SETIPTDKTVTYKHMDPPPAPMQDRSPSPRHPDVSMVDPDALAVDQNLGKAVKKDLKEKT 2239 

Qy 999 DIKKTG 1004 

I I I 

Db 2240 KTKKPG 2245 



Search completed: September 29, 2004, 18:16:03 
Job time : 39.8322 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 



September 29, 2004, 18:16:15 ; Search time 126.45 Seconds 

(without alignments) 
2997.869 Million cell updates/sec 



US-09-830-972-29 
Perfect score: 5923 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 



Scoring table : 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1351062 seqs, 321799191 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1351062 



Database 



Published_Applications AA:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



/cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 
/ cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US0 6_NEW_PUB . pep : * 
/cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 
/cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB . pep : * 
/cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 
/ cgn2_6/ptodata/2/pubpaa/US 09A__PUBCOMB . pep : * 
/cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: 
/cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: 
/cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB . pep : * 
/cgn2_6/ptodata/2/pubpaa/USlOA_PUBCOMB.pep: 
/ cgn2_6/ ptodata/2 /pubpaa/US10B_PUBCOMB . pep : 
/ cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: 
/cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 
/ cgn2_6/ptodata/2 /pubpaa/US60_NEW_PUB . pep : * 
/ cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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5 


5810 


98. 


1 


1192 


9 


US-09-893-348-23 


Sequence 2 3 , Appl 


6 


5810 


98. 


1 


1192 


12 


US-10-267-502-429 


Sequence 429, App 


7 


5810 


98. 


1 


1192 


16 


US-10-327-213-9 


Sequence 9, Appli 


8 


5810 


98. 


1 


1192 


16 


US-10-466-258-9 


Sequence 9, Appli 


9 


5794 


97. 


8 


1192 


12 


US-10-408-967-7 


Sequence 1, Appli 


10 


4296.5 


72. 


5 


1163 


9 


US-09-893-348-18 


Sequence 18 , Appl 


11 


4277.5 


72. 


2 


1163 


12 


US-10-267-502-431 


Sequence 431 , App 


12 


1495.5 


25. 


2 


373 


9 


US-09-789-386-6 


Sequence 6, Appli 


13 


1495.5 


25. 


2 


373 


9 


US-09-765-205-6 


Sequence 6, Appli 


14 


1495.5 


25. 


2 


373 


9 


US-09-893-348-24 


Sequence 24, Appl 


15 


1495.5 


25. 


2 


373 


12 


US-10-408-967-8 


Sequence 8 , Appli 


16 


1495.5 


25. 


2 


373 


14 


US-10-060-036-72 


Sequence 72 , Appl 


17 


1487.5 


25. 


1 


373 


16 


US-10-466-258-4 


Sequence 4 , Appli 


18 


1417 


23. 


9 


289 


9 


US-09-789-386-4 


Sequence 4 , Appli 


19 


1225.5 


20. 


7 


379 


14 


US-10-205-194-164 


Sequence 164 , App 


20 


1187 


20. 


0 


360 


9 


US-09-893-348-20 


Sequence 2 0, Appl 


21 


931 


15. 


7 


199 


9 


US-09-893-348-25 


Sequence 25 , Appl 


22 


931 


15. 


7 


199 


12 


US-10-660-946-1 


Sequence 1, Appli 


23 


924 


15. 


6 


199 


12 


US-10-408-967-9 


Sequence 9, Appli 


24 


922 


15. 


6 


199 


12 


US-09-978-360A-467 


Sequence 4 67, App 


25 


908 


15. 


3 


199 


9 


US-09-893-348-21 


Sequence 2 1 , Appl 


26 


890 


15. 


0 


199 


16 


US-10-466-258-11 


Sequence 11, Appl 


27 


791 


13. 


4 


111 


14 


US-10-205-219-93 


Sequence 93, Appl 


28 


787.5 


13. 


3 


776 


12 


US-10-660-946-5 


Sequence 5, Appli 


29 


787.5 


13. 


3 


776 


12 


US-10-267-502-430 


Sequence 430, App 


30 


779 


13. 


2 


780 


12 


US-10-267-502-432 


Sequence 432 , App 


31 


705 


11. 


9 


356 


12 


US-10-660-946-6 


Sequence 6, Appli 


32 


704 


11. 


9 


593 


15 


US-10-108-260A-2892 


Sequence 2892, Ap 


33 


688 


11. 


6 


208 


12 


US-10-660-946-7 


Sequence 7 , Appli 


34 


671 


11. 


3 


267 


12 


US-10-660-946-8 


Sequence 8, Appli 


35 


671 


11. 


3 


267 


14 


US-10-205-194-127 


Sequence 127 , App 


36 


630 


10. 


6 


266 


12 


US-10-276-774-2330 


Sequence 2330, Ap 


37 


625.5 


10. 


6 


236 


9 


US-09-729-674-20 


Sequence 20, Appl 


38 


625.5 


10. 


6 


236 


9 


US-09-765-205-26 


Sequence 2 6 , Appl 


39 


625.5 


10. 


6 


236 


12 


US-10-408-967-2 


Sequence 2 , Appli 


40 


625.5 


10. 


6 


269 


14 


US-10-106-698-6222 


Sequence 6222, Ap 


41 


541.5 


9. 


1 


168 


10 


US-09-809-391-563 


Sequence 563, App 


42 


541.5 


9. 


1 


168 


10 


US-09-882-171-563 


Sequence 563 , App 


43 


541.5 


9. 


1 


168 


12 


US-10-164-861-563 


Sequence 563 , App 


44 


520 


8. 


8 


222 


12 


US-10-267-502-428 


Sequence 42 8 , App 


45 


513 


8. 


7 


241 


12 


US-10-660-946-3 


Sequence 3, Appli 



ALIGNMENTS 



RESULT 1 
US-09-758-140-6 

; Sequence 6, Application US/09758140 
; Patent No. US20020012 965A1 
; GENERAL INFORMATION: 

; APPLICANT: Strittmatter , Stephen M. 



; TITLE OF INVENTION: No. US2 0020012965Alo Receptor-Mediated Blockade of 
Axonal Growth 

; FILE REFERENCE: 4 4 574-5073-US 

; CURRENT APPLICATION NUMBER: US/09/758,14 0 

; CURRENT FILING DATE: 2001-01-12 

; PRIOR APPLICATION NUMBER: US 60/175,707 

; PRIOR FILING DATE: 2000-01-12 

; PRIOR APPLICATION NUMBER: US 60/207,366 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: US 60/236,378 

; PRIOR FILING DATE: 2000-09-29 

; NUMBER OF SEQ ID NOS : 20 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 6 

LENGTH: 1192 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-758-140-6 

Query Match 98.2%; Score 5815; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 6.1e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 

QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I M I I M I II I I I I I I II I I I I I I I I I | | | | M I I I I I I I I I I I I I I I I I I I I | | | | | | 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPT AP AAGAP LMD FGND EVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP AP 120 

I I M I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | M I I I I I I I I I I I I I I I I 
Db 61 AGL S AAP VPT AP AAGAP LMD FGND FVP PAP RG P L PAAP P VAP ERQ P S WD P S P VS S T VP AP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I M II I I I I I I I I I I I I I I M I I I I I | | | | | | | | | | | | | | | | | | | | | | || | | | | | | || 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

M I : I : = I I M I I I I I I II I I I I I || I I I I | | | | | || | | | 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I II I I I I I I I I I M I I I I I I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FS VS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 34 6 

M I I I I I I I I I I I I I I I II I I I I i I I I I I I I I I I I I I I I I I I I I I I I I II I I I 1 I I I I I 
Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 406 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I 1 I I II I I I I I I I I I I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 42 0 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I M I I II I I II I I I I II I I I I I I I | I I | | | | | | | | | | M | | | || | | | | | | | 

Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 

Qy 467 PLLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 
HI I II I I I I I I I I II I I I II I I I I I I | || | || | | | | | | 



Db 


481 


r xiXjkjjjr x oHjIn J\i Dcii\j\XL^iijJ\JxMvX V i L^rUN 1 b i J\i bN Pr L VAAQDSETDYVTTDNLTKVTE 


540 


Ov 


S97 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 
1 M 1 M 1 1 1 M 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | | | | | | | | | | | | | | M | | | 

TPI f\ TT\ "NTTvA'D TP f T rpnrvT -i t-/-\ rp 7\ r^TP c TP T MTTT rrn /■"■» m vt 7\ vrmr/y /rr\ t t r^m /-it-it _ 

i v v an iyl p & CjL, r P D L VQ EAC E S E LN E VT GT K I AYETKMDLVQTSEVMQESLYPAAQLCPSF 


586 


Db 


O ± 


600 


Ov 


Sft 7 

JO/ 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 
1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 i 1 1 1 II 1 I 1 1 1 I | | | | | | | | | | | | | | | | | | | | | | || 
Ebbh^lPbPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 


Db 


DUX 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | | | | | | | | | | | | | | | | | | | | || || | | | | || 
lV\MbVbXjKJ\VbGxKEEIKEPENINA^ 


705 


Db 


OOJ. 


720 


Ov 
Db 


706 

/ X 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 
1 1 1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 II I I I I I | | | | | | | | | || | | | | | | | | | || | || || | | | || | 
1YIAA v i^yp V P Uft b t liVEDb b P D S E PVDL FS DD S I P DVPQKQDETVMLVKESLTETSFESMI 


765 
780 




7 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 I I I || | | | | | | | | | | | | | | | 1 1 1 I | | | 

i^itNJ\EKLbALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


825 


Db 


7 ft 1 
/OX 


840 


Ov 


ft 9 £ 
o z o 


SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 

1 1 N 1 1 1 M II 1 1 I M 1 1 1 1 1 1 1 1 1 M I I | | | | | | || M | | || | | | || | | | | || 

bNUULt ISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 




ft 4 1 

O X 


900 


yy 




ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 M 1 1 1 1 1 1 M 1 M 1 1 1 1 
ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 


945 


Db 


901 


959 


Qy 


946 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 
M 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I | || | | | | | | | | | | | | | : | I I I I | | 1 1 1 1 1 1 1 1 1 1 1 I 
TQAEIESIVT^PKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSVVDLLYWRDIKKTGV 


1005 


Db 


960 


1019 



Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I N I I I I I I I I I I I I I I | | | || | | || | | | | | | | M I I I I I I I I I I I I I M | | | | | | | | | 
Db 1020 VFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVl^IWFTWGALFNGLT 1125 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M | | | | | | | | | | 

Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGlYANKNVKDAMAKIQAKIPGLKRKAx^ 1178 

I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I 
Db 1140 LLI LALI SLFSVPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 



RESULT 2 

US-09-972-599A-6 

; Sequence 6, Application US/09972599A 
; Patent No. US20020077295A1 
; GENERAL INFORMATION: 

; APPLICANT: STRITTMATTER, STEPHEN M. 

; TITLE OF INVENTION: NOGO-RECEPTOR-MEDIATED BLOCKADE OF AXONAL GROWTH 
; FILE REFERENCE: C077 CIP US 

; CURRENT APPLICATION NUMBER: US/09/972 , 599A 
; CURRENT FILING DATE: 2001-10-06 



PRIOR APPLICATION NUMBER: PCT/U5 01/0 104 1 

PRIOR FILING DATE: -2001-01-12 
; PRIOR APPLICATION NUMBER: 09/758,140 
; PRIOR FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: 60/236,378 
; PRIOR FILING DATE: 2000-09-29 
; PRIOR APPLICATION NUMBER: 60/207,366 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: 60/175,707 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 57 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 6 

LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-972-599A-6 



Query Match 98.2%; Score 5815; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 6.1e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 



d\7 

yy 


-1 
I 


MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 

1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M | | | | | | | | 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


UD 


1 


60 


Pit 7 


ol 


AGL S AAP VP TAPAAGAP LMD FGNDFVP PAP RG P L PAAP P VAP ERQPSWDPS P VS S T VP AP 

1 1 N 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I I M 1 1 1 1 1 II 1 1 1 M 1 I 1 1 
AGL S AAP VP TAPAAGAP LMDFGNDFVP PAP RG P L PAAP P VAP ERQ P S WD P S P VS S T VP AP 


120 


nu. 
UD 


bl 


120 


Qy 


121 


SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 
1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | M 1 1 1 II 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 | | M 1 1 1 1 
SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Db 


121 


180 


Qy 


181 


SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 
1 1 1 : 1 : : 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I || I | | | | 
SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 


226 


Db 


181 


240 


Qy 


227 


LS7VASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I I I I 1 1 I I | | M | | | | | M | | | | | | | | | || | 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


286 


Db 


241 


300 


Qy 


287 


GSS FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 

1 1 1 1 1 1 1 1 1 1 1 1 M 1 II M 1 1 1 1 1 1 II 1 1 1 | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | 1 1 

GSS FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 


346 


Db 


301 


360 


Qy 


347 


S SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDS KEDSDMLAAGGKI ESNLESKV 

1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 II II 1 1 1 1 II | | || | | | | | 1 1 1 1 1 1 1 1 

S SEKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKDS KEDSDMLAAGGKI ESNLESKV 


406 


Db 


361 


420 


Qy 


407 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 
11 1 1 1 N 1 1 1 1 1 1 1 1 1 II | | | | | | | | | | | | | | | | | | | || || , | || || | , | 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 


466 


Db 


421 


480 


Qy 


467 


PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 
HI 1 M 1 1 1 1 II 1 I I I I | I | | M 1 II 1 1 1 1 1 | | | | | | | | | | | | | | | | | | | | | || | | | 

PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


526 


Db 


481 


540 



Qy 


527 


Db 


541 


Qy 


587 


Db 


601 


Qy 


647 


Db 


661 


Qy 


706 


Db 


721 


Qy 


766 


Db 


781 


Qy 


826 


Db 


841 


Qy 


886 


Db 


901 


Qy 


946 


Db 


960 


Qy 


1006 


Db 


1020 


Qy 


1066 


Db 


1080 


Qy 


1126 ] 


Db 


1140 ] 



EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | || | | | | | | | | | | | | | | | | || | || | U| | | | 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 
EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

IIIM| IIIIMIIMIIIIIIIIIMI|||||IMIIMIIINIIIIIIIIII||||| 



1 1 1 1 11 1 1 1 I I 1 1 I I I I I I I I I I I I I M I 1 1 I I I I | | | | | 1 1 



MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

1 11 1 1 1 I I I I I I I I I II I I I II I M I I I I I I I I I I I I I I I I I | | | | | | I I I I I | | | | | | | 

MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 
EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 84 0 
SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

1 1 1 1 11 I I I I I I M I I I I I I I M I I I I I I I I I | | || I I I M I I I I II I I I | | | | 

SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 
I I I I I I I I I I I I I I I I | | | | I I I M I I I I I I I I I | | I I I I I I I I M I I I M I 

ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 
I I I I I I I I I I I I I I I I II I | | | || | | | | | | | | | | | | | | : | | | | | I I I I I I I I I I I I I I 

TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 
VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

1 11 1 11 I I I I I I I I M I I I I I M I I I I I II M I I I I I I I I I I I I I I M I I I I I I I 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 
: 1 1 1 I I I I I I I I I I M I I I I II I I || | | | | | | | | | | | | | | | | | | | | 

EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 
I I I I I I I I I I I I | | | | | | | || | | || | | | | | || | | | | | | | | | 



RESULT 3 

US-10-060-036-71 

Sequence 71, Application US/10060036 
Publication No. US20030073144A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Benson, Darin R. 
Kalos, Michael D. 
Lodes, Michael J. 
Persing, David H. 
Hepler, William T. 
Jiang, Yuqiu 



; TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE THERAPY 
; TITLE OF INVENTION: AND DIAGNOSIS OF PANCREATIC CANCER 
; FILE REFERENCE: 210121.566 

; CURRENT APPLICATION NUMBER: US/10/060, 036 
; CURRENT FILING DATE: 2002-01-30 
; NUMBER OF SEQ ID NOS : 4560 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 71 

LENGTH: 1192 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-060-036-71 



Query Match 9 8.2%; Score 5815; DB 14; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 6.1e-284; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 
Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

, I M I I I I I I I I I I I | | M I I I I I I I I I I | | | | | | | M I I I I I I I I I I I I II I I I I 

Db 1 ^EDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

QY 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

liJliiL'iii 1 M I I I I I I M I I I I I I I I I M I I I I I M I I M I M 

PVSSTVPAP 120 



Db 61 A-GLS AAP VPT AP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP ERQ P S WD P S 



Q y 121 SPLSA ^AVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRi 



— ■ — — — — — 1 vim -vG 180 

1 1 1 1 1 1 1 I I I I M I I I I I I I I I | | | | | | I M I II I II I I I I I M I I I II I 

RG 180 



Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 



QY 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

IV'' l: : I I I I I I I I I I I I I I I I I II I I I I I I I I | | | | | | 

SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 



Db 181 



Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 
11 'II I | | | | | | | | | M | | | || | | | | | I I I I I I I I I | | | | 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 



Db 241 



Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 



_ VV 346 

'iiiliilJiJi'ilHllJ! 1 1 1 1 1 1 1 m 1 1 1 1 1 1 ii i ii 1 1 1 1 1 1 1 1 1 1 M i 

DEW 360 



Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKE 



QY 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 
1" 'I I I I I I N I M | | | | | | | | || | | | | || | | || | || || || , | | 



Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 



420 



QY 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I 1 1 I I I I I I I I I II I I I || | | | | | || | | | | || I | | M | | I I 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 



Db 421 



Qy 467 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

111 1 1 1 1 I 1 I I I N I I I I I I I I I I I I I I I I I | | | | | | || | | || | | | | | | | I | | | | I. 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I I I M I I I I I I I I I M M I I I I I I I I I I I I I | | | | | | | | | | | | | | | || | | | I | | I | || 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 



Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | || | | | | | | | | | | | | 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

Dh ««i lllllll 1 1 1 1 1 1 1 1 1 11 1 1 1 11 1 I I I I I I I I I I M I I II I I I I I I I | | | M | | M I I I 

Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

n „ 1 ' II I I I I I I I | I I | | | | | | | | | || || | | | | | || | | | || | | 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Q y 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

1 I I I I I I I I I I I I I I I I I I I I I I I | || | | | | | | | | || | 

Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 84 0 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 
11 1 1 I I I I I I I I I I I I II II | || | | | | | | | | || | | | | | | | | | || | | || | | | , | | , 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

nn M I I I I I I I I I I | | | | | | I I II I I I I I I I I I I | | | | | | | | | | | | | | | | , | || 

Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

acn UNI MM I UN Ml INI, III II MM III IM: I I I I I I I I I I I I I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I II I I I I I I I I | I I I I I I I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

lnQ ' 1 1 1 1 I I M I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 1080 EVAISEELVQKYSNSALGHYNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 



Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I M I I I I I I I I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I | | | 
1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



Db 



RESULT 4 
US-09-789-386-2 

; Sequence 2, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 

; TITLE OF INVENTION: NOVEL COMPOUNDS 

; FILE REFERENCE: GP-30165-C1 

; CURRENT APPLICATION NUMBER: US/09/789,386 

; CURRENT FILING DATE: 2001-02-21 

; PRIOR APPLICATION NUMBER: U.K. 9916898.1 

; PRIOR FILING DATE: 1999-07-19 

; PRIOR APPLICATION NUMBER: U.K. 9816024.5 

; PRIOR FILING DATE: 1998-07-22 

; PRIOR APPLICATION NUMBER: US 09/359,208 



; PRIOR FILING DATE: 1999-07-22 
; NUMBER OF SEQ ID NOS : 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 1192 

TYPE: PRT 
; ORGANISM: HOMO SAPIENS 
US-09-789-386-2 



Query Match 98.1%; Score 5810; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. l.le-283; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps ■ 

Q y 1 MEDLD QSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I | | | | | | | | | | | | | | I I I I I I I I I I I I I | | | | | | | | | M I I I ^ 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Db 1 



Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 
i I I I I I I I I I I I I I | | || | | | | | | | | || | | | | | | | || | | | || | 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 



QY 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

^ < I I I I I I I I I I I I I | | | M I I I I I I I I | | | | | | | | | | | M II I I I I I M | 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 



Db 121 



180 



Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

no " l|s 1= : I M M I M I I I I I I I I M I I I I I I I I I I I I I I | 

Db 18i SSGSVDETLF7UjPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 



Q y 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEPCAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I MINIMI I I I || | | || | | | | | | | | | | | | | || 

LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 



Db 241 



Q y 287 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEVV 346 

M I M II II I I I I II M I I I I I I I I I I I | | || | | || | | | | || | | | | | | 

Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEVV 3 60 

QV 347 SSEKAKDSFNEKRVAVEAPMREEYAI)FKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 406 

HI I I II II I I I I I I | | | | | | II I I I I I M I I M I II I I I I 

Db 36i SSEKAKDSFNEKRVAVEAPMREEYAJDFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 
M I M I I I I II | || || || | || | || || | | || || | , | | | | | | | 

Db 42 1 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 



Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKWE 526 

I I I MM I II II II I I I I I I | | I I | | | | | M I I M I I I 

PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 



Db 481 



Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

I M II I I I I I I I M II I I | | | | | || | | | | | | | | | || | 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 
M I II II II I || || | || || | | | | | | | | | | | | | | || || | | || | , | | | | || || | , | 

Db 60i EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 



Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

111,111 M I I I I I I I I I I I | | | | | | | | | | | | | | M | I I I I I I I II I II M I I I I | I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

nH _ 91 ' ' ' 1 N I I M I I I I I I I I I I I I I II I I I I I I I I || | | || | | | | || | | | M | | | | | 

DJD /Z1 ^-^EQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

no , 1 M I I I I I I I I I I I I I I I I I I I I II I I I I | | || || | mm 

Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

QA , ''''I' INN || || | | | | | | | | | | | | || | | || | | | | , || | | | | 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

1 1 1 I I I I I I I I I M II I I I I I I M I I I II I I I I I I I I I I I I I I I I I I M | II I II I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 
QCn I I I N I I I I M I I I I II I I I I I || | || | | | | || : | | | | || , | | | | , | | | | | | | 

Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

i n o ri IN MMI I II MIIIIIIIIIIIMIIIIIIIIIIIIIMIIMIIII 

Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 

Qy 1066 EVAI S EELVQKYSNS ALGHVNCT I KELRRLFLVDDLVDS LKFAVLMWVFT YVGAL FNGLT 1125 

1non I I N I I I I I I I I I I I I I I | I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I 

Db 1080 E VAI S E EL VQKYSNSAXGHVNCT I KELRRLFLVDDLVDS LKFAVLMWVFT YVGAL FNGLT 1139 

Qy 1126 LLI LALI SLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

I I I I I N I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | ,|| M 

Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 5 

US-09-893-348-23 

Sequence 23, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EIS-SCHWARTZ=2A 

CURRENT APPLICATION NUMBER: US/ 09/ 8 93 , 34 8 

CURRENT FILING DATE: 2001-06-28 

PRIOR APPLICATION NUMBER: US 09/314,161 

PRIOR FILING DATE: 1999-05-19 

PRIOR APPLICATION NUMBER: US 09/218,277 

PRIOR FILING DATE: 1998-12-22 

PRIOR APPLICATION NUMBER: PCT/US98/14715 



; PRIOR FILING DATE: 1998-07-21 

; PRIOR APPLICATION NUMBER: IL 124500 

; PRIOR FILING DATE: 1998-05-19 

; NUMBER OF SEQ ID NOS : 2 9 

; SOFTWARE: Patent In version 3.1 

; SEQ ID NO 23 

LENGTH: 1192 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-893-348-23 



Query Match 98.1%; Score 5810; DB 9; Length 1192; 

Best Local Similarity 97.2%; Pred. No. l.le-283; 

Matches 1159; Conservative 4; Mismatches 14; Inciels 16; Gaps 



Qy 


l 


MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | || | | | | | | | | | | || | | | | | | | | | | | | | | | 

MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 


60 


Db 


l 


60 


Qy 


61 


AG L S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP AP 

1 1 1 1 1 1 1 1 1 1 1 1 1 I || | | | || | | | | | | | | | M | || || | || | | | | | | 

AG L S AAP VP TAP AAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP AP 


120 


Db 


61 


120 


Qy 


121 


SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 

1 1 N 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II | | | | | | | | 1 ! II 1 1 II 1 1 1 1 1 1 1 1 1 1 I M 

SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 


Db 


121 


180 


Qy 


181 


SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 
1 1 1 : 1 : : 1 1 M 1 1 II II II II II 1 1 1 1 || I || | || | || | | 
SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 


226 


Db 


181 


240 


Qy 


227 


LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 

N 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


286 


Db 


241 


300 


Qy 


287 


GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEW 

1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 II 

GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 


346 


Db 


301 


360 


Qy 


347 


SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I | | | | || | | | || | | | | M 1 1 1 1 1 

SSEK7VKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 


406 


Db 


361 


420 


Qy 


407 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 
1 1 1 1 1 N 1 1 1 1 1 1 1 II 1 1 1 1 | | | M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I I | 1 1 1 1 1 1 
DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 


466 


Db 


421 


480 


Qy 


467 


PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 
Ml 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I I I I II 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


526 


Db 


481 


540 


Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M II 1 1 1 II M 1 1 1 1 1 1 I | | | | | | | | | | | | | | | | || | | | | | | M 
EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


586 


Db 


541 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 



N I I I I I I II I I I I I I I I I I M I | | | | | | | | | | || | | | M I I I I I M I I I 



Db 


601 


j-jj-iij j_ur\.x iroirvijir-UJ. vi^iv-Viri-ilN o/\v ir o>vcr.rt.o v -L^-tr ooor XjiLAoo VIM liliol KnEPENPPPYE 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 
1 1 1 1 M 1 1 1 1 1 1 1 1 I I I I 1 1 1 1 1 1 I I I M I I I | | | | | | | | | M 1 1 1 1 1 1 1 || | | | | | | | 


705 


Db 


661 


720 


Qv 


706 


MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 
1 1 1 1 1 1 1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 I | 1 I 

ljrux v ^v-^ viri^noijijviLJJooirjJoiijrvjJijr ojjjjoX xriJVryj\yjJrjl VMLVKhiSLTETSFESMI 


765 


Db 


721 


780 


Qv 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 
1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 II 1 1 1 1 II 1 II | || | | | 

i^i ili^j n.rirvijo/^ijf fUjijUAir' ill HjOc J\lib IiJJJn 1 J\L>1 LLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


781 


840 


Ov 


826 


SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 

HI M 1 II 1 II 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 | | M 1 M 

oinujjijt J-^rv&iHyiKHjl iiii r bUbbrXhillDbt PTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


900 


Ov 
y 


886 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 

1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 II 

v tjAta b 1. 1? u 1 Uli FtL DLb L KN I QP KVEEKI S FS DD FS KNGS AT S KVLLLPPDVSALA- 


945 


Db 


901 


959 


Qy 


946 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 
1 1 1 N 1 1 1 1 1 M M 1 M 1 1 1 1 1 1 1 1 1 1 1 I I I | | | | | | | : | II 1 1 1 II 1 1 1 1 I I I | | M 
j. v i\rTy.vij v J\ri/vrjJ\rviji-' bUl £,J\ilDKo FbAI r SAELSKTSWDLLYWRDIKKTGV 


1005 


Db 


960 


1019 


Ov 
Db 


1006 
1020 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 1 1 1 1 1 1 1 I I I | | | 1 1 1 1 1 1 1 1 1 1 I I I I I I | | | | | | | | | | | 

vr Liijij^iji vr bl VbvlAYlALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 


1065 
1079 


Ov 
y 


1066 


EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 

1 1 1 1 MUM M 1 | | | | | | | | | | | | | | | | | | | || | | | | M| 

EVAI SEE L VQ K Y S N SAL GHVN C T I KE L RRL FL VD D LVD S L K FAVLMWVFT YVGAL FN GLT 


1125 


Db 


1080 


1139 


Qy 


1126 


LLI IALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 1 1 1 1 | || | 

LLI LALI SLFSVPVI YERHQAQI DHYLGIANKNVKDAMAKIQAKI PGLKRKM 1192 




Db 


1140 





RESULT 6 

US-10-267-502-429 

; Sequence 429, Application US/10267502 

; Publication No. US2004007 1700A1 

; GENERAL INFORMATION: 

; APPLICANT: Kim, Jaeseob 

; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 

FILE REFERENCE: LSD-07416 
; CURRENT APPLICATION NUMBER: US/10/267, 502 
; CURRENT FILING DATE: 2003-01-27 
; NUMBER OF SEQ ID NOS : 439 
; SOFTWARE: Patentln version 3.2 
; SEQ ID NO 429 
LENGTH: 1192 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-267-502-429 



Query Match 98.1%; Score 5810; DB 12; Length 1192; 

Best Local Similarity 97.2%; Pred. No. l.le-283; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 

Q y 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

„ I I I I I I I I I I I I I I I I II II I II I I I I I I I I I I | M I I I I I I I I I | | | | M I I I I 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

1 1 I N I I M I I I I I I I I I I I I II I I I I I I II I I I | | | | || | | || | | | | | | || || 

Db 61 AGL S AAP VP T AP AAGAP LMD FGND EVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP AP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

1 1 I I I I I I I I I I II I I I I I I II I I I I I I M I I I I I I I II I I I I I I I I I I I | | | M | | | I I 
Db 121 SPLSA AAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 22 6 

io I 1 1 : I : : I I I I I I I I I I I II I I I I I I I I I I | | | | | | | | | | 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I i i i i i i i i i i i i i || i i i i 

Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 



Q y 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

I I I I M I I I | | | I I | | | I I I I I I I II I || I | | | | | || | | | || 

GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 



Db 3 01 



Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 406 

UiiliiliJiilJJJi 1 I 1 1 1 I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ESNLESKV 42 0 



Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKI 



Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I II I I I I I I I I I I I | | | | | | | | | | | | | | | | | 

DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 



Db 421 



Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 526 

HI I I I I I I I I I I I I I I I I II I I I I I I III | | | | | | | | | || | | 

PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 



Db 4 81 

Qy 527 
Db 541 



EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 
I I I I M I I II I | | | | | | || | | | | | | | | | | | | | | || | | || | , | || | | , , | | 

EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 



QY 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

Mill I I I I M I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | || 

Db 601 EESEAT PSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

r I N I I I I I I I II I I I I I I I I I I I I I | | | | | | | | | | | I II I I I I I I I I I I I I I I 

Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

n ^ 11 I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I | M I I I I I I I I I 

Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 



Ov 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 

1 I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | M M 1 1 1 1 1 | | | | | | | M 1 1 1 1 1 1 1 

Hi i ^JM KhKLbALPPhGGKPYLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


825 


Db 


7 R 1 


840 


Ov 


826 


SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 

1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 M | I I | M 1 1 1 1 I I I | | | I I | I I I | | | | | | 

an Duke -LoKtiAyiKLlETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 


Db 


841 


900 


Ov 


ft ft £ 

o o u 


ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 
N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | M II 1 1 1 1 I I I I | | | | | | | | | | M 1 

AJNAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVXLLPPDVSALA- 


945 


Db 


_7 U _L 


959 


Ov 


J T U 


TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 
N M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I || | | | || | | : | | | | | | | || | | | | || | || | 

l _^1--^1VKFKVLVK_AEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 


1005 


Db 


960 


1019 


Ov 


XUuO 


VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 
1 1 1 N 1 1 1 1 1 1 1 1 1 1 I I | | | | | | | | | | | | | | | | | | | | | | | | | || | || | | | , , | | , 

VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 


1065 


Db 


i n 9 n 
± u _i u 


1079 


Ov 

sly 


_L U O O 


EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLIW\/FTYVGALFNGLT 

M 1 1 I II 1 1 II II 1 1 1 II 1 1 1 M 1 1 II 1 1 II 1 1 | | | | | | | | | | | | | || | | | || 

EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 


1125 


Db 


1080 


1139 


Qy 


1126 


LLILALISLFSVPVIYERHQAQIDHYLGI n 78 

N M 1 1 M II 1 1 1 1 1 1 II I I | || | || | | | | | | || | | | | | | | | | | || | | 

LLI LAL I S LFS VPVI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1192 




Db 


1140 





RESULT 7 
US-10-327-213-9 

; Sequence 9, Application US/10327213 

; Publication No. US20040121341A1 

; GENERAL INFORMATION: 

; APPLICANT : FILBIN, MARIE T. 

; APPLICANT: DOMENICONI, MARCO 

; APPLICANT: CAO, ZIXUAN 

; TITLE OF INVENTION: INHIBITORS OF MYELIN-ASSOCIATED GLYCOPROTEIN (MAG) 

; TITLE OF INVENTION: ACTIVITY FOR REGULATING NEURAL GROWTH AND REGENERATION 

FILE REFERENCE: CUNY/ 003 
; CURRENT APPLICATION NUMBER: US/10/327 , 213 
; CURRENT FILING DATE: 2002-12-20 
; NUMBER OF SEQ ID NOS : 4 3 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 9 

LENGTH: 1192 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-327-213-9 

Query Match 98.1%; Score 5810; DB 16; Length 1192; 

Best Local Similarity 97.2%; Pred. No. l.le-283; 

Matches 1159; Conservative 4; Mismatches 14; Indels 16; Gaps 3; 

1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

' ' ' I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I I I I I I 

Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Qy 


61 


AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 1 




Db 


61 


AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 


120 


Qy 


121 


SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 


180 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 | I | 




Db 


121 


SPL S AAAVS PSKLPEDDEP P ARP P P P P P AS VS P QAE P VWT P P AP AP AAP P S T PAAP KRRG 


180 


Qy 


181 


SSGA WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 


226 


Db 


181 


llh 1 : : 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 


240 


Qy 


227 


LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


286 






1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 II I | | | | | | | | | | | | | || | I | | | | | | | 




Db 


241 


LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 


300 


Qy 


287 


GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 


346 






1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


301 


GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 


360 


Qy 


347 


SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 


406 






1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 




Db 


361 


SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 


420 


Qy 


407 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 


466 






1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | | | | | | | | | | | | | | | || | | | | | || | | | | | 




Db 


421 


DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 


480 


Qy 


467 


PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 


526 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 I I I I I | | | 1 1 1 1 1 1 1 1 1 I I I I M 1 1 1 1 1 1 




Db 


481 


PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 


540 


Qy 


527 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


586 






M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 I 1 1 II 1 1 1 1 I M I I I I I I I I | | || | | | | | | | | | | | | | | | || 




Db 


541 


EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 


600 


Qy 


587 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


646 






1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 || 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 | | | I | | 




Db 


601 


EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 


660 


Qy 


647 


EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


705 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 




Db 


661 


EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 


720 


Qy 


706 


MAKVEQPVPDHSELVEDS S PDS EPVDLFS DDS I PDVPQKQDETVMLVKESLTETS FESMI 


765 






1 1 M 1 II 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II I I 1 1 I I I I I I 1 I I 




Db 


721 


MAKVEQPVPDHSELVEDS SPDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETS FESMI 


780 


Qy 


766 


EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 


825 






1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | | | | || | | | | | M 1 1 II 1 1 1 II 1 1 1 1 1 1 




Db 


781 


EYENKEKLSALPPEGGKPYLES FKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 


840 


Qy 


826 


SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


885 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 




Db 


841 


SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 


900 



Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I | I M 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

QY 94 6 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I || | | M I II M I I II I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I M I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 102 0 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI QAIQKSDEGHPFRAYLES 107 9 

Qy 10 66 EVAISEELVQKYSNS7VLGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 

M I I I I II I I I I I I I I I I I I I I I I II I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | || M 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVTiSLKFAVLMWFTYVGALFNGLT 1139 

Qy 1126 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKM 1178 

M I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 8 
US-10-466-258-9 

Sequence 9, Application US/10466258 
Publication No. US20040132096A1 
GENERAL INFORMATION: 
APPLICANT: GLAXO GROUP LIMITED 
TITLE OF INVENTION: ASSAY 
FILE REFERENCE: P80966 GCW 

CURRENT APPLICATION NUMBER: US/10/4 66, 258 
CURRENT FILING DATE: 2003-07-15 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 9 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-466-258-9 



Query Match 98.1%; 
Best Local Similarity 97.2%; 
Matches 1159; Conservative 



Score 5810; DB 16 
Pred. No. l.le-283 
4; Mismatches 14 



Length 1192; 
Indels 16; Gaps 



3; 



Qy 



Db 



1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I M I I I I II I II I I I I I I I I I II I I I I I II I I I I I I I I I II I I I I I I I 
1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Qy 

Db 

Qy 

Db 

Qy 



61 AGL S AAP VPT APAAGAP LMD FGN D FVP PAP RGP L PAAP P VAP E RQ P S WD P S P VS S T VP AP 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I II I I I I I I I I I 1 || I I I I I I I I 

61 AGL S AAP VP T APAAGAP LMD FGND FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VPAP 120 

121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

M I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I I I I I 

121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



181 



SSGA- 
III: 



-WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 
I : : I M I I I I I I I I I I I I I I I I I II I I I I I I Ml: 



Db 181 S S GSVDETLFALPAASEPVI RS SAENMDLKEQPGNTI SAGQEDFP SVLLETAASLPSLS P 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FSVS PKAESAVIVANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLA/KEDEW 346 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I 
Db 301 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHNQQELPTALTKLVKEDEW 360 

Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKV 4 06 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

I M I I I I M I I I I I I I I I I II I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 540 

Qy 527 EW7\NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I M I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I II I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 781 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 840 

Qy 82 6 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 8 85 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I 
Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 8 86 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy ' 946 TQAEIESIWPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I : I I I I I I || I I I | I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAI FSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I II I I I I I I II I I I I I I I I I I | I I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1079 



Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1125 

I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 

Qy 1126 LLI LALI SLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 

I I II I I I I I I II I I I I I I I i I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I 
Db 1140 LLI LALI SLFSVPVI YERHQAQ I DHYLGLANKNVKDAMAKIQAK I PGLKRKAE 1192 

RESULT 9 
US-10-408-967-7 

Sequence 7, Application US/10408967 
Publication No. US2 004 0063161A1 
GENERAL INFORMATION: 
APPLICANT: Pharmacia & Upjohn Company 
APPLICANT: Yan, Riqiang 
APPLICANT: Lu, Yifeng 

TITLE OF INVENTION: Compositions and Methods of Treating Alzheimer f s Disease 
FILE REFERENCE: 00925 

CURRENT APPLICATION NUMBER: US/10/408 , 967 
CURRENT FILING DATE: 2003-04-08 
NUMBER OF SEQ ID NOS : 9 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 7 
LENGTH: 1192 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-408-967-7 

Query Match 97.8%; Score 5794; DB 12; Length 1192; 

Best Local Similarity 97.0%; Pred. No. 6.9e-283; 

Matches 1157; Conservative 4; Mismatches 16; Indels 16; Gaps 3; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VP T AP AAGAP LMD FGND FVP PAP RGP L P AAP P VAP ERQ P S WD P S P VS S T VP AP 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 61 AGL S AAP VP TAP AAGAP LMD FGND FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP AP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSP 226 

III: I : : II I II M I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 181 S SGSVDETLFALPAAS EPVI RS SAENMDLKEQPGNTI SAGQEDFPS VLLETAASLP SLS P 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 28 6 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I II I I I I I 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEW 346 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I 

Db 301 GS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 



Qy 347 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 406 

I I I I I I I I I I I I I I I I I I I ! I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 361 SSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDSDMLAAGGKIESNLESKV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I I I I I II I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRPGAYITCAPFNPAATESIATNIF 480 

Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

III I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 58 6 

I I II I M I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 64 6 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 

Qy 647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 EAMSVSLKKVSGIKEEIKEPENINAALQETEAPY1SIACDLIKETKLSAEPAPDFSDYSE 720 

Qy 706 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 765 

I I I I I I I I I I I I I I I I II I I I I I I I M I I I I I I I I I I II I I I I I I I I M I I I II I I II I I 
Db 721 MAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMI 780 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 EYENKEKLSALPPEGGKPYLES FKLSLINTKDTLLPDEVSTLSKKEKI PLQMEELSTAVY 840 

Qy 826 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 885 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I II I I I I I I || 

Db 841 SNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEI 900 

Qy 886 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALA- 959 

Qy 946 TQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGV 1005 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I 
Db 960 TQAEIESIVKPKVLVKEAEKKLPSDTEKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGV 1019 

Qy 1006 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLES 1065 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I M I I I Mill 
Db 1020 VFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFPAYLES 107 9 

Qy 1066 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVG7VLFNGLT 1125 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1080 EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFTYVGALFNGLT 1139 

Qy 1126 LLILALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKIQAKI PGLKRKAE 117 8 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 1140 LLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1192 



RESULT 10 
US-09-893-348-18 

Sequence 18, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SENBACH- SCHWARTZ , Michal 
APPLICANT: COHEN, I run R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2 A 
CURRENT APPLICATION NUMBER: US/ 09/ 8 93 , 34 8 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 . 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/ 14 7 15 
PRIOR FILING DATE: 1998-07-21 
PRIOR APPLICATION NUMBER: IL 124500 
PRIOR FILING DATE: 1998-05-19 
NUMBER OF SEQ ID NOS : 29 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 18 
LENGTH: 1163 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-893-348-18 

Query Match 72.5%; Score 4296.5; DB 9; Length 1163; 

Best Local Similarity 74.0%; Pred. No. 1.3e-207; 

Matches 885; Conservative 104; Mismatches 156; Indels 51; Gaps 19 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

111:111 I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I II : I I : : I I I I I I I I I I I I I 
Db 1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

Qy 59 P AAGL S AAP VPT AP AAGAP LMD FGND FVP PAP RG P L P AAP P VAP E RQ P S WD P S P VS S T VP 118 

I I I I I I I I II III I I I : I I : I II II I I II I I I I I I I I I I I I I I : II : I 
Db 61 PAAGLSAAAVP — PAAAAPLLDFS SDSVP PAPRGPLPAAPPAAPERQP SWERS PAA P 115 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 17 8 

III I I I I I I I I II I I I I I I I I I I I I II II I I M I I I I I I I I I I I 

Db 116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE P AAP P S T P AAP K R 166 

Qy 179 RGSSG AWXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSL 224 

III : I I I I I I I I I I I : I : I 1 I M I I I I I II I I I I I I 

Db 167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

Qy 225 SPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 

I I I I I I I I I I I I I I I I : I I I : : I : : I I I I I : I : I : : II I I I I I I I I I 

Db 227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFSELEYS 286 



QY 



285 EMGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 



Db 287 EMGS S FKGS PKGESAI LVENTKEEVI VRSKDKED-LVCSAALHS PQES PVG KEDR 340 

Qy 345 WSSEKAKDSFNEKRVAVEAPMREEY7U3FKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 403 

I I I I I I I I I ::: I II : I I I I I II I I I I : I I I II : I I I : I I I : I : I 

Db 341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

Qy 4 04 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

Nihil INN: I I I I I : I I I I I I I I : I I I I I I I I I I : t I I I 
Db 397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCAS FT-SATESTTA 455 

Qy 464 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

I I I I I I I I I I I I I I I I I I I : I I I I : I II II I I I I I I II I I I I I I I I I I I : I 
Db 456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 

Qy 524 VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLC 583 

III I : I I II I I I I I I I II I I I I I I I I I I I II I I I I I : I I I I I I I : I I I I I I I I I I 
Db 515 VTET^AVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 574 

Qy 584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 

I II I I : I I I I I M I I I I I I I I I I I I : I I I I I I I : I I I I I I I I I : I : I I I I I I I I 
Db 575 PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

Qy 643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 7 02 

I I I I I I I : I : I I I II INN: I I I : I I I I I I I I I II I I I I I I I I I I I I : I I I I : 
Db 635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

Qy 703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

llhll I : M : I : II I I I I I I : I I I I I I I I I I I I I : I I I I : I I I I : I I I I M I I 
Db 695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

Qy 763 SMIEYENKEKLSAIiPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

: : : : : : I : I I I I I I I I I II I I : : I : I I I : : : I I : I I I I I I II I I : I 

Db 754 TVAQHK-EERLSAS PQELGKPYLES FQPNLHSTKDA-ASNDI PTLTKKEKI SLQMEEFNT 811 

Qy 823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

I : I I I I I I III : I : I : I I I I I I I I I I I I I I I I I : I : I II I II : I I II I I I I I 
Db 812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

Qy 883 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

I I I I I I I II I I I II I I I I I I I I : : : I I : I I : I I : II : I : I I I 
Db 871 SEIANIQSGADSLPCLELPCDLSFKNIYPK— DEVHVSDEFSENRSSVSKASISPSNVSA 928 

Qy 943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

I I I : I I I I I I I I I I I I I II I I I I I I I I II: I I : I I I I I I I I I I I I I I I I 
Db 929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

Qy 1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

I I I I I I I I I I I I I I I I II I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1048 LESEVAISEELVQKYSNS7VLGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

Qy 1123 GLTLLIIJUISLFSVPVIYERHQAQIDHYLGI^KNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I : | | | | I | | | I I II I I I I | I I : 



Db 



1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQ7VKIPGLKRKAD 1163 



RESULT 11 
US-10-267-502-431 

; Sequence 431, Application US/10267502 
; Publication No. US2 004 0071700A1 
; GENERAL INFORMATION: 

APPLICANT: Kim, Jaeseob 
; APPLICANT: Galant, Ron 

; TITLE OF INVENTION: Obesity Linked Genes 
; FILE REFERENCE: LSD-07416 

; CURRENT APPLICATION NUMBER: US/ 1 0/2 67 , 502 
; CURRENT FILING DATE: 2003-01-27 
; NUMBER OF SEQ ID NOS : 439 

SOFTWARE: Patentln version 3.2 
; SEQ ID NO 431 

LENGTH: 1163 
; TYPE: PRT 

; ORGANISM: Mus mus cuius 
US-10-267-502-431 

Query Match 72.2%; Score 4277.5; DB 12; Length 1163; 

Best Local Similarity 73.8%; Pred. No. 1.2e-206; 

Matches 883; Conservative 108; Mismatches 154; Indels 51; Gaps 19 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111:111 I I I I I Mill! I I I I I I I I I I I ! I I : I I : I I II :: I I I I M I I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP ERQPSWDPS P VS S T VP A 119 

I I I I I I I I I III llhll : I I I I I I I I I I I I I I I I I I II I I I : II : I I 

Db 61 AAGLSAVPVP — P AA- AP LLDFSSDS VP PAP RG P L P AAP P TAP E RQ P S W E R S P AAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II I I I I I I I I I I I II I I I I I I I I I I II I I I I I I M I I I I I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

II : I I I I I I I I I I I I : I : I I I II I I I I I I I I I I I I I 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 223 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSE 2 85 



Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNE^SRELPERATNPFVNRESAEFSVLEYSE 283 

Qy 286 MGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 345 

Mill: III I II : : I I =ll = tt = =fl = l= II : II II I I II : II I I I 
Db 284 MGS S FNGS PKGESAMLVENTKEEVI VRSKDKED-LVCSAALHNPQES PATLTKWKEDGV 342 

Qy 346 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLES 404 

MM I I I I : : M II M M I I II I I I I : II II h I I I M I I M M I 

Db 343 MSPEKTMDIFNEMKMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 398 

Qy 405 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 464 

I I II I I I I II I I : I M I II I :: I I I M II : II I I II II I : I I I I I I 
Db 399 KVDKKCFEDSLEQKSHGKDSESRNENASFPSTPELVKDGSRAYITCDSFT-SATESTAAN 457 



Qy 4 65 IFPLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

111:111 I I II I I I I I I I I : I I I I : I I I II I I I I I I I I III I II I I I I I : I I 
Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 

Qy 525 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

II III I I I I I I I I I I I I I II I I I I I I I I I I I I II I : II I I I II : I I I : II I I I I I 

Db 517 TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 57 6 

Qy 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

I I I I : I I I I II I I I I I I I I I I I I I : I I I I I I 111:1111 I I : I : II I I I I I I 
Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 7 03 

I I I I I I I I : I I I I I I I I I I I : III II I I I II I I I I I I I I I I I I I II : I I I : I 
Db 637 PYEEAMSVALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 763 

1:11 I : I I I I I I I : I I I I : I I I I I I I I I I I II : I II hi I I I : I I I I I I I I : 
Db 697 SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ET 755 

Qy 7 64 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 823 

: : : : : I I : I M I I I I I I I I I I : : I III : I : I I : I I I I I I I I I : I I 

Db 756 VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 814 

Qy 824 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 883 

: I I I I II III ::: I : I I I I I I I I I II I I I I I I : I : I II : I I I I I I I I : I I 

Db 815 IYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKS 87 0 

Qy 884 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

I I I I I I I I I I : I I I I I I I I III : I I : I I I : I : I I I I I I : I I I I 
Db 871 EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVSKVPLLLPNVSAL 928 

Qy 944 GHTQ7VEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKT 1003 

: I I : : I I I I I I I I I I I : I I I II I I I I I I I : I : I I : I I I I I I I I I I I M I I I I 
Db 929 -ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKT 987 

Qy 1004 GW-FGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 98 8 GVVYFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1048 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I M I I I I I I I I I I I I I I 
Db 1108 GLTLLILALI SLFS I PVI YERHQAQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAE 1163 

RESULT 12 
US-09-789-386-6 

; Sequence 6, Application US/09789386 

; Patent No. US20020010324A1 

; GENERAL INFORMATION: 

; APPLICANT: MICHALOVICH, DAVID 

; APPLICANT: PRINJHA, RABINDER KUMAR 



TITLE OF INVENTION: NOVEL COMPOUNDS 
FILE REFERENCE: GP-30165-C1 
CURRENT APPLICATION NUMBER: US/09/78 9, 38 6 
CURRENT FILING DATE: 2001-02-21 
PRIOR APPLICATION NUMBER: U.K. 9916898.1 
PRIOR FILING DATE: 1999-07-19 
PRIOR APPLICATION NUMBER: U.K. 9816024.5 
PRIOR FILING DATE: 1998-07-22 
PRIOR APPLICATION NUMBER: US 09/359,208 
PRIOR FILING DATE: 1999-07-22 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 6 
LENGTH: 373 
TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-789-386-6 

Query Match 25.2%; Score 1495.5; DB 9; Length 373; 

Best Local Similarity 31.6%; Pred. No. 2.1e-67; 

Matches 372; Conservative 1; Mismatches 0; Indels 8 05; Gaps 1; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I II I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPTAPAAGAP LMD FGND FVP PAP RGP L PAAP P VAP ERQ P S WD P S P VS S T VP AP 120 

I I I II I I I I I I I I i II I I I I I II I I I I I I I I I I I I I II I I I I I II M I II I I I I I I I I I I 
Db 61 AGL S AAP VP T AP AAGAP LMD FGN D FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP AP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I t I I I I I II I I I I I I I I I I I I I I I I I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 
I I I : I 

Db 181 SSGSV 185 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Db 186 185 

Qy 301 IVANPREEI IVKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEWS SEKAKDSFNEKRV 360 

Db 186 185 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATNIFPLLEDPTSENXTDE 4 80 

Db 186 185 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 

Db 186 185 



Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Db 186 185 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSWYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 

Db 186 185 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 72 0 

Db 186 185 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

Db 186 185 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

Db 186 185 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Db 186 185 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

Db 186 185 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 186 WDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGV1QAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I II I I 1 I I I I I I 

Db 216 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 275 

Qy 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

I I I 1 I I I I I I I I M I I I I I I I I I I I II II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 276 ALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 335 

Qy 1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 117 8 

I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 336 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 37 3 



RESULT 13 
US-09-765-205-6 

; Sequence 6, Application US/09765205 
; Patent No. US20020034800A1 
; GENERAL INFORMATION: 
; APPLICANT: Cao, Li 

; TITLE OF INVENTION: BONE MARROW SECRETED PROTEINS AND POLYNUCLEOTIDES 

; FILE REFERENCE: 1458.004/200130.4 4 9 

; CURRENT APPLICATION NUMBER: US/ 09/765 , 205 

; CURRENT FILING DATE: 2001-01-17 

; PRIOR APPLICATION NUMBER: US/ 09/212,440 



; PRIOR FILING DATE: 1998-12-16 
; NUMBER OF SEQ ID NOS : 4 6 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 6 

LENGTH: 373 
; TYPE: PRT 

ORGANISM: human 
US-09-765-205-6 

Query Match 25.2%; Score 1495.5; DB 9; Length 373; 

Best Local Similarity 31.6%; Pred. No. 2.1e-67; 

Matches 372; Conservative 1; Mismatches 0; Indels 805; Gaps 1 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I i I I I I I I I I I I I I I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I 

Db 61 AGLSAAPVPTAPAAGAPLMDFGNDFVP PAPRGPLPAAP PVAPERQPSWDPS PVS STVPAP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I II I I I I I I I I I I I I I I I I I I I I 

Db 121 S P L S AAAVS P S KL P E DDE P P ARP P P P P P AS VS P QAE P VWT P P APAPAAP P S T P AAP KRRG 180 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 



Db 181 SSGSV 185 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Db 186 185 

Qy 301 I VANPREEI IVT^NKDEEEKLVSNNI LHXQQELPTALTKLVKEDEVVS SEKAKDS FNEKRV 360 

Db 186 185 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

Db 186 185 

Qy 481 KKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 

Db 186 185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Db 186 185 

Qy 601 VMEAP LNSAVP SAGAS VI QP S S S PLEAS SVN YES I KHEPENP PP YEEAMSVSLKVS GI KE 660 

Db 186 185 



Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

Db 186 185 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

D b 186 185 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

D b 186 185 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Db 186 185 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

Db 186 185 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 186 WDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRI YKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

I I I I I I 1 I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 216 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 275 

Qy 1081 ALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 27 6 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG7VLFNGLTLLILALISLFSVPVI 335 

Qy 1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 336 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 373 



RESULT 14 
US-09-893-348-24 

Sequence 24, Application US/09893348 
Patent No. US20020072493A1 
GENERAL INFORMATION: 
APPLICANT: EI SEN BACH -SCHWARTZ , Michal 
APPLICANT: COHEN, Irun R. 
APPLICANT: BESERMAN, Pierre 
APPLICANT: MOSONEGO, Alon 
APPLICANT: MOALEM, Gila 

TITLE OF INVENTION: ACTIVATED T-CELLS, NERVOUS SYSTEM-SPECIFIC ANTIGENS AND 
THEIR USES 

FILE REFERENCE: EI S- SCHWARTZ =2A 
CURRENT APPLICATION NUMBER: US/09/893 , 34 8 
CURRENT FILING DATE: 2001-06-28 
PRIOR APPLICATION NUMBER: US 09/314,161 
PRIOR FILING DATE: 1999-05-19 
PRIOR APPLICATION NUMBER: US 09/218,277 
PRIOR FILING DATE: 1998-12-22 
PRIOR APPLICATION NUMBER: PCT/US98/ 14 7 15 



PRIOR FILING DATE: 1998-07-21 
; PRIOR APPLICATION NUMBER: IL 124500 
; PRIOR FILING DATE: 1998-05-19 
; NUMBER OF SEQ ID NOS : 29 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 24 
; LENGTH: 373 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-893-348-24 

Query Match 25.2%; Score 1495.5; DB 9; Length 373; 

Best Local Similarity 31.6%; Pred. No. 2.1e-67; 

Matches 372; Conservative 1; Mismatches 0; Indels 8 05; Gaps 1; 

QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

M I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | M | | | 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGL S AAP VPTAP AAGAP LMD FGND FVP PAP RGP L P AAP PVAP ERQ P S WD P S P VS S T VP AP 12 0 

M I I II I I II I I I I M I I I I I I I I I I I I | | | | | | | | M | | | | | | | || | | | | | | | | | | | | | 
Db 61 AGL S AAP VP TAP AAGAP LMD FGN D FVP PAP RG P L PAAP PVAP E RQ P S WD P S P VS S T VP AP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I N I I I I I I I I I I I || || I I I I | I | | | | | | | | | | | | | | | | | | | | | | | | | | | | 

Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 18 0 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 24 0 

I I I : I 

Db 181 SSGSV 185 

Qy 241 LSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 



Db 186 185 

Qy 301 I VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEWS S EKAKDS FNEKRV 360 

Db 186 185 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Db 186 — 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

Db 186 185 

Qy 4 81 KKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEVVANMPEGLTPDL 540 

Db 186 185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Db 186 185 

Qy 601 VMEAPLNSAVP SAGAS VIQPSSS PLEAS SVN YES I KHEPENPPPYEEAMSVSLKVSGIKE 660 



Db 186 185 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

Db 186 — 185 

Qy 721 EDSS PDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETS FESMI EYENKEKLSALPPEG 780 

Db 186 185 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

Db 186 185 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Db 186 185 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

Db 186 185 

Qy 961 KEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 186 WDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I 
Db 216 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 275 

Qy 1081 ALGHWCTIKELRRLFLVDDLVDSLKFAVL3y[WFTYVGAIjFNGLTLLILALISLFSVPVI 1140 

I I M I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I | I | | | I I I I I I I I I I I 
Db 276 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 335 

Qy 1141 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I II Ml I I I I I I I I I I I I I I I I I I I I I M II I I I I I 
Db 336 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 373 



RESULT 15 
US-10-408-967-8 

; Sequence 8, Application US/10408967 
; Publication No. US2 004 0063 161 Al 
; GENERAL INFORMATION: 

; APPLICANT: Pharmacia & Upjohn Company 
; APPLICANT: Yan, Riqiang 
; APPLICANT: Lu, Yifeng 

; TITLE OF INVENTION: Compositions and Methods of Treating Alzheimer's Disease 
; FILE REFERENCE: 00925 

; CURRENT APPLICATION NUMBER: US/10/408,967 
; CURRENT FILING DATE: 2003-04-08 
; NUMBER OF SEQ ID NOS : 9 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH: 373 

TYPE: PRT 
; ORGANISM: Homo sapiens 



US-10-408-967-8 

Query Match 25.2%; Score 1495.5; DB 12; Length 373; 

Best Local Similarity 31.6%; Pred. No. 2-le-67; 

Matches 372; Conservative 1; Mismatches 0; Indels 805; Gaps 1; 

Qy 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I M I I I 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

Qy 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 61 AGL S AAP VP TAPAAGAPLMD FGN D FVP PAP RGP L P AAP P VAP E RQ P S WD P S P VS S T VP AP 120 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I | | | | | I I 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 



Db 181 SSGSV 185 

Qy 241 LSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAV 300 

Db 186 185 

Qy 301 IVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRV 360 

Db 186 185 

Qy 361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

Db 186 185 

Qy 421 EKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 

Db 186 185 

Qy 481 KKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 540 

Db 186 185 

Qy 541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDI 600 

Db 186 185 

Qy 601 VMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYEEAMSVSLKVSGIKE 660 

Db 186 185 

Qy 661 EIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELV 720 

Db 186 185 

Qy 721 EDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

Db 186 185 



Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

Db' 186 185 

Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

Db 186 185 

Qy 901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ7VEIESIVKPKVLE 960 

Db 186 185 

Qy 961 KE7VEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVF 102 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 186 WDLLYWRDIKKTGWFGASLFLLLSLTVF 215 

Qy 1021 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 1080 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I M I I I I I I I I I I I I I I 1 I I I I I I I II I I I I I I 
Db 216 SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 275 

Qy 1081 ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 1140 

I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 27 6 ALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVG7VLFNGLTLLILALISLFSVPVI 335 

Qy 1141 YERHQAQI DH YLGLANKNVKDAMAKI QAKI P GLKRKAE 1178 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 336 YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 373 



Search completed: September 29, 2004, 18:48:18 
Job time : 132.45 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



September 29, 2004, 18:06:43 ; Search time 92.6748 Seconds 

(without alignments) 
4010.587 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-09-830-972-29 
5923 

1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 1178 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1017041 



Database 



SPTREMBL 25:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp__bacteria : * 
sp_fungi : * 
sp__human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp vertebrate:* 

sp_unclassif ied: * 

sp_rvirus : * 

sp_bacteriap : * 

sp_archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 

X 


4 ft S1 


81 . 


9 


986 


4 


Q8IUA4 


Q8iua4 homo sapien 


9 


4289 


72 . 


4 


1162 


11 


Q8BGM9 


Q8bgm9 mus musculu 


3 


4 9 77 S 


72 . 


2 


1163 


11 


Q8K3G8 


Q8k3g8 mus musculu 


A 




61 . 


8 


1046 


11 


Q8BGK7 


Q8bgk7 mus musculu 


D 


9 sft n s 

Z, JOU • 


43 . 




■ 720 


11 


Q7TNB7 


Q7tnb7 mus musculu 


o 


2527 


42 . 


7 


639 


11 


Q8K290 


Q8k2 90 mus musculu 


/ 


99 


37 . 


7 


578 


11 


Q80W95 


Q80w95 mus musculu 


ft 
o 


'1 SSI 

_L *J -J X 


25 . 


8 


392 


4 


Q96B16 


Q96bl6 homo sapien 


Q 


i 9 m s 

XZ U X ■ -J 


20 . 


3 


375 


11 


Q8BHF5 


Q8bhf5 mus musculu 


X U 


X X W-J 


19 . 


6 


356 


11 


Q8BH78 


Q8bh78 mus musculu 


1 1 
1 X 


1 1 S9 S 

X X OZ - J 


19 . 


5 


357 


11 


Q8K3G7 


Q8k3g7 mus musculu 


1 9 




15 . 


2 


184 




Q7YRW9 


Q7yrw9 bos taurus 




887 


15 . 


0 


199 


13 


Q7T224 


Q7t224 gallus gall 


X % 


ft 67 


14 . 


5 


17 9 


6 


Q9GM33 


Q9gm33 macaca fasc 


1 s 
X D 


7 ft ft S 

/ OO ■ J 


X O . 


3 


760 


13 


Q90638 


Q90638 gallus gall 


X D 


77 Q 


13 . 


2 


780 


11 


Q8K4S4 


Q8k4s4 mus musculu 


1 7 
X / 


77 ft 


13 . 


1 


780 


11 


Q8K0T0 


Q8kOtO mus musculu 


1 ft 
X O 


7 S S S 


12 . 


8 


214 


13 


Q7T222 


Q7t222 carassius a 


1 Q 
17 


D J7 _? 


XX. 


ft 


643 


11 


08CCU2 


Q8ccu2 mus musculu 


Z U 


£;ft ft 
DO 0 


1 1 

XX. 


D 


1 QQ 

X Zf z> 


4 


Sd U <-> -s 


Q9bq59 homo sapien 


9 1 

Z 1 


£79 
o / z 


1 1 

XX. 


O 


208 


13 


090637 


Q90637 gallus gall 


9 9 
Z Z 


£7 1 
D / X 


1 1 

XX. 




9 67 


1 1 

X X 


063765 


Q63765 rattus sp. 


9 ^ 
Z o 


OZ D 


1 n 

X \J . 


u 


237 


11 


08C6D5 


Q8c6d5 mus musculu 


9 4 
Z H 


£9 S S 
OZo ■ O 


1 n 

X U . 


S 


236 


11 


O8VBU0 


Q8vbu0 rattus norv 


9 R 
Z D 


Sft £ 


Q 
-7 ■ 


Q 


991 

X 


13 


07ZUD6 


Q7zud6 brachydanio 


9 ^ 
Z D 


S££i 


Q 


, D 


O _7 «J 


5 




Q9vmv9 drosophila 


9 7 
Z / 


S9 ft 


8 . 


9 


234 


5 


Q9VMW3 


Q9vmw3 drosophila 


9 ft 
Z O 


S9 ft 


ft 


Q 


2484 


5 


Q9U347 


Q9u347 caenorhabdi 


9 Q 
Z " 


526 


8 , 


. 9 


224 


5 


Q9VMW1 


Q9vmwl drosophila 


o u 


oz u 


Q 

0 1 


p 

, 0 


222 


5 


09VMW4 


Q9vmw4 drosophila 


o X 




0 

0 • 


Q 

, O 


9 fif)7 


5 


Q23187 


Q2 3 1 8 7 caeno r habdi 


Q 9 

o z 


SI ft 


Q 

O « 


7 


202 


5 


09VMW2 


Q9vmw2 drosophila 




O O O ■ ■u 


6 . 


( 1 


1417 


3 


Q871Y7 


Q871y7 neurospora 




^ S7 S 


a 

. vJ 1 


n 


2768 


5 


O 9 VC 0 0 


Q9vc00 drosophila 


^ S 


S4 S 

O fi O 




ft 


107 


13 


Q7T223 


Q7t223 carassius a 


o D 


S4 1 S 
oil . J 




ft 


5412 


5 


09W596 


Q9w596 drosophila 


^7 


^4 n s 




7 


999 


5 


023188 


Q2 3 1 8 8 caenorhabdi 


^ ft 


O O -3 




_ 7 


1150 


5 


O8IMM6 


Q8imm6 drosophila 




SS4 




• u 


X U X ~* 


5 


09MFS3 


Q9nfs3 drosophila 


4 o 


334 


5 , 


. 6 


18074 


5 


Q9I7U4 


Q9i7u4 drosophila 


41 


331 


5, 


.6 


7962 


4 


Q10465 


Q10465 homo sapien 


42 


331 


5 


.6 


34350 


4 


Q8WZ42 


Q8wz42 homo sapien 


43 


325.5 


5 


.5 


5327 


5 


076891 


076891 drosophila 


44 


322.5 


5 


.4 


846 


5 


Q8MRP6 


Q8mrp6 drosophila 


45 


322.5 


5 


.4 


1109 


5 


Q9VAY4 


Q9vay4 drosophila 



/ALIGNMENTS 



RESULT 1 
Q8IUA4 

ID Q8IUA4 PRELIMINARY; PRT; 986 AA. 

AC Q8IUA4; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23 f Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 



DE 

GN 

OS 

OC 

OC 

OX 

RN 

RP 

RA 

RT 

RT 

RL 

RN 

RP 

RA 

RL 

RN 

RP 

RA 

RL 

RN 

RP 

RC 

RX 

RA 

RT 

RT 

RL 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

SQ 



RNT4 (RTN4) . 
RTN4. 

Homo sapiens (Human) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
NCBI_TaxID=9606; 
[1] 

SEQUENCE FROM N.A. 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4."; 

Submitted (OCT-2002) to the EMBL/GenBank/DDB J databases. 

[2] 

SEQUENCE FROM N.A. 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/GenBank/DDB J databases. 
[3] 

SEQUENCE FROM N.A. 
Van der Putten H.; 

Submitted (MAY-2002) to the EMBL/GenBank/DDB J databases. 
[4] 

SEQUENCE FROM N.A. 
TISSUE=Testis; 

MEDLINE=22376540; PubMed= 12488097; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4."; 
J. Mol. Biol. 325:299-323(2003). 
EMBL; AY102285; AAM64244.1; 
EMBL; AY123245; AAM64249.1; 
EMBL; AY123246; AAM64250.1; 
EMBL; AY123247; AAM64251.1; 
EMBL; AY123248; AAM64252.1; 
EMBL; AY123249; AAM64253.1; 
EMBL; AY123250; AAM64254.1; 

GO; GO: 0005783; C : endoplasmic reticulum; IEA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

SEQUENCE 986 AA; 108449 MW; 0CDE8 F64 7 0364 15A CRC64; 



Query Match 81.9%; 
Best Local Similarity 98.9%; 
Matches 97 6; Conservative 



Score 4851; DB 4; Length 986; 
Pred. No. 2.4e-231; 
1; Mismatches 8; Indels 2; 



Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 



193 MDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 252 

I 1 I I I I I M I I I M I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
1 MDLKEQPGNTISAGQEDFPSVLLETAASLPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQ 60 

253 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 312 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I 
61 ENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVK 120 

313 NKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYAD 372 

I | | I I I I I M I II I I I I I II I II I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I 
121 NKDEEEKLVSNNI LHNQQELPTALTKLVKEDE VVS SEKAKDS FNEKRVAVEAPMREEYAD 180 



Qy 


373 


Db 


181 


Qy 


433 


Db 


241 


Qy 


493 


Db 


301 


Qy 


553 


Db 


361 


Qy 


613 


Db 


421 


Qy 


672 


Db 


481 


Qy 


732 


Db 


541 


Qy 


792 


Db 


601 


Qy 


852 


Db 


661 


Qy 


912 


Db 


721 


Qy 


972 


Db 


780 


Qy 


1032 


Db 


840 


Qy 


1092 


Db 


900 


Qy 


1152 


Db 


960 



FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 432 

| | | | M I M I I I I I I I I I I M I I I I I I I I I I I I I I I I M I II I 1 I I I I I I I 1 I I I I I M I 
FKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTS 24 0 

FPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVT 4 92 

I | t I I II II I I I I M I I I I I I II I I M I I I I I I I I M I I I I I I I II I I I I I I II II I I 
FPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLGDPTSENKTDEKKIEEKKAQIVT 300 

EKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDLVQEACESELNEV 552 
| | | | | | I | | | I I I I I I I I I I I II I I I I I II I I II I I I I I I II I I I I I I I I I I i I I I M I 



TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPS 612 

|| I I I I I I I II I I II I I I I I I I M I I M II I I I I I I N II I I II I I I I I I I I I I I I I I H 
TGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPS 42 0 



I I I I I M I II II I I I I I I I I I I I I I I I I M I II I I I I I II I t I I I I I II I I I I I II I I I 



LQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVD 731 
I | | M | I I I M I I I I I I I I I I I I II I I II I I I I I I I I I I M I I I I I I I I I 



LFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLS 7 91 

I I I I I I I I I I I I I I I I I I I I I I I II I I I M I M I I I I I I I I I I I I II M I M I I I I I I I I 
LFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLS 600 

LDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIE 851 

| | | | | || M I II I I I I I I I I I I I M I I M I I I I I I I II I I I I I I I M I I I I I I I I I M I I 
LDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIE 660 

IIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQP 911 

I | | | I I I I || I I I M I I I I I I I I I I II I M I I II I I M II I M I I I I I I I I I I M I M M 
IIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQP 72 0 

KVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVljEKEAEKKLPSDT 971 

I I I I I I I I I I I I I I I I I I I I I I MM II I I M M II II II I II II M II M 

K\^EKISFSDDFSKNGSATSKVLLLPPDVSALA-TQAEIESIVKPKVLVKEAEKKLPSDT 77 9 

EKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAL 1031 

I | | | | | I I I I I || : I I II II I I II II I II II I M M M II I I I II M I M I II 11 I II I 
EKEDRSPSAIFSAELSKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIAL 83 9 

aT.T.^/TT.qTrRTYKf^VTOATOKSDEGHPFRAYLESEVAISEELVOKYSNSALGHVNCTIKE 1091 

899 



I | | | | | || || I II I I I M I I I II I I II II II I II II I II II II I II I I I I M M M I I M 



LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHY 1151 

I I I I I M II II II II I II I II II II I I I M II I M I II I I I II I M I II II I M I 

LRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHY 959 

LGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

II I II I I II II I I II I II I II II II II 
LGLANKNVKDAMAKIQAKIPGLKRKAE 986 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
SQ 



PRELIMINARY; 

(TrEMBLrel. 
(TrEMBLrel. 
(TrEMBLrel. 



23, 
23, 
25, 



PRT; 1162 AA. 
Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Rodentia; Sciurognathi ; Muridae; Murinae; Mus , 



RESULT 2 
Q8BGM9 
ID Q8BGM9 
Q8BGM9; 
01-MAR-2003 
01-MAR-2003 
01-OCT-2003 
RTN4. 
RTN4. 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., van der Put ten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 
Structures of Human and Mouse Nogo/Rtn-4 . " ; 
Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 
[2] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 



[3] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7; 
Van der Putten H.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A. 

STRAIN=129SvcJ7; 

Van der Putten. H., Mir A.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AY102284; AAM73506.1; -. 
EMBL; AY102286; AAM73511.1; -. 
MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
GO; GO: 0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

SEQUENCE 1162 AA; 126613 MW; 855697FBEE117 8 IF CRC64; 



Query Match 72.4%; Score 4289; DB 11; 

Best Local Similarity 73.9%; Pred. No. 1.7e-203; 
Matches 883; Conservative 109; Mismatches 153; 



Length 1162; 
Indels 50; 



Gaps 18; 



QY 
Db 



1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111:111 II I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I I :: I I I I I I I I II I I I I 
1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 



Qy 60 AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 119 

I I I I I II I I I 111 I I I : I I : I II I I I I I I II I I I I I I II I I I I : II : I I 
Db 61 AAGLSAAPVP — PAA-APLLDFS S DSVPPAPRGPLPAAPPTAPERQPSWERS PAAS A 114 



Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II I I I I M I I II I I I II II I I I I II I I I I M I I I II I II I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR--PPAPAGASPLAE — — PAAPPSTPAAPKRR 163 

Qy 180 GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

II : I M I I 1 I I I I I I : I : I I I I I I I I I I I I I I I I I I 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETT^ASLPSLS 223 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSE 285 

III I I I I I I I I I I I I II I I : : I : : I I I : I : I : I : : I : I I I II I I I 

Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 283 

Qy 286 MGS S FSVS PKAESAVI VANPREEI IVKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 345 

Mill: III II I : : I I : I I : i I : : I I : I : II : II II I I I I : I I M I 

Db 284 MGSSFNGSPKGESAMLVENTKEEVIVRSKDKED-LVCST^ALHNPQESPATLTKWKEDGV 342 

Qy 346 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLES 404 

: I I I I I I I ::: I I I : I I I I I I I I I II : I I I I I : I I I : I I I : I : I I 

Db 343 MSPEKTMDIFNEMKMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 398 

Qy 405 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 464 

I I I I I I I I I I I I I I I I I I I : : I I I I II : M I I II I I I : : I I I I I I 

Db 399 KVDKKCFEDSLEQKGHGKDSESRNENASFPRTPELVKDGSRAYITCDSFS-SATEST7VAN 457 

Qy 465 IFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

I I I : I II I I I I I I I I I I I I : I I II : I I I II I I I I II II I I I I I II I I I I : I I 

Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 

Qy 525 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

II III I I I I I I I I I I I I I I I I I I I I I II I I I I II I : I I I I I I I : I I I : I I I I I I I 

Db 517 TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 576 

Qy 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

I I I I : I I I I I I I I I I I I I I I I I I I : I I I I I I I II : I I I I I I : I : I I I I I I I I 

Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 63 6 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 703 

I II I I I I I : I I I I I I I I I I I : I I I II i II I I I II II I I 1 II I I I I I : I : I I : I 

Db 637 PYEEAMSVALKTSDSKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 763 

I I : I I I : I I I I I I I : I I I I : I I I I I I I I I I I I I : I I I I : I II I : I I I I I I I I : 

Db 697 SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ET 755 

Qy 764 MIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 823 

: : : : : I I : I M I I I I I I I I I I : : I III : I : I I : I I I I I I I I I : I I 

Db 756 VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 814 

Qy 824 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 883 

: I I I I I I III ::: I : I I I I I I I I I I M I I I I I : I : I II : I I I I I I I I : I I 

Db 815 IYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKS 870 

Qy 884 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVS7VL 943 

I II I I I I I I I : I I I I I I II II I : I I : I I I : I : I I I I I I : I I II 

Db 871 EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV--SDEFSKSRSSVSKVPLLLPNVSAL 92 8 



QY 



944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKT 1003 



: | | : : I I I I M I I I I I : I I I I I I I I I I I I = I = I I : I I I I I I I II I I I I I I II 
Db 929 -ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLS7VELNKTSWDLLYWRDIKKT 987 

Qy 1004 GWFGAS L FLLLS LT VFS I VS VTAYI ALALL S VT I S FRI YKGVI QAI QKS DEGHP FRAYL 1063 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I M I I I II 
Db 988 GWFGAS L FLLL S LTVFS I VS VT AY I ALALL SVT IS FRI YKGVI QAI QKS DEGHP FRAYL 1047 

Qy 1064 ESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNG 1123 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 1048 ESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNG 1107 

Qy 1124 LTLLI LALI SLFSVPVI YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 117 8 

I I I I M I I I I I I I : I I I I II I I I I I I I I I I II I I : I I I I I I I I I I I I I II I 1 I I I 
Db 1108 LTLLI LALI SLFS I PVI YERHQAQI DHYL G LANK SVKDAMAK I QAKI PGLKRKAE 1162 



RESULT 3 
Q8K3G8 

ID Q8K3G8 PRELIMINARY; PRT; 1163 AA. 

AC Q8K3G8; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-A. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=l(K)90; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Jin W., Long M. , Li R., Ju G. ; 

RT "Cloning and expression of the mouse Nogo-A protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY114152; AAM77068.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1163 AA; 126691 MW; 6B5F362799417EA4 CRC64; 

Query Match 72.2%; Score 4277.5; DB 11; Length 1163; 

Best Local Similarity 73.8%; . Pred. No. 6.2e-203; 

Matches 883; Conservative 108; Mismatches 154; Indels 51; Gaps 19; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111:111 I I I I I I I I I I I I I I I I I I I I I I I I I : I I : I I I !:: I I I I I II I I M I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VPTAPAAGAP LMD FGN D FVP PAP RGP L P AAP PVAP ERQ P S WD P S P VS S T VPA 119 

II I M I I I I I I I I I I : II : I I I I I I I I I I I II I I I I I I I I I I : I I : I I 
Db 61 AAGLSAVPVP--PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 



Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 



Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

II : I I I I I I I I I I I I : I : II II I I I I I I I I I I I I 1 I 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 22 3 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVS EASKEVSEKAKTLLIDRDLTEFSELEYSE 285 

III I I I I I II I I I I I I M I : : I : : I I I : I : I : I : : I : I I I I I I I I 

Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 2 83 

Qy 286 MGSSFSVSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEV 345 

Mill: III I I I :: I I : I I : I I : : I I : I : II : II II I I I I : I I I I I 
Db 284 MGSSFNGSPKGESAMLVENTKEEVIVRSKDKED-LVCSAALHNPQESPATLTKWKEDGV 342 

Qy 346 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLES 4 04 

: I I I I I I I ::: I II : I I I I I I I II I I : I I I I I : I I I : I I I : I : I I 

Db 343 MS PEKTMDI FNEMKMS WAPVREEYADFKP FEQAWEVKDT YEGS RDVLAA RANMES 398 

Qy 4 05 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAP FNPAATESIATN 464 

I I I I I I I I I I I I : I I I I I I I :: I I I I I I I : I I I I I I I I I : I I I I I I 
Db 399 KVDKKCFEDSLEQKSHGKDSESRNENASFPSTPELVKDGSRAYITCDSFT-SATESTAAN 457 

Qy 465 IFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

I I I : I I I I I I I II I I I I I I : I I I I : I I I II I I I I I I II III I II I M I I : I I 

Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 

Qy 525 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 

II III I I I I I I I I I I I II I I I I II I I I I II I I I I I : I I I I I I I : I I I : I I I I I I I 

Db 517 TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 576 

Qy 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

I I I I : I I I I I I I I I I I I I I II I I I : I I I I I I I I I : I I I I I I : I : II I I I I I I 
Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENIN7\ALQETEAPYISIACDLIKETKLSAEPAPDFSDY 703 

I I II I I I I : I I 1 i I I I I I I I : III II I I I I I I I I I I I I I II I I I I I : I I I : I 

Db 637 PYEEAMSVTMjKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFES 763 

II : I I I : I I I I I I I : I I I I : I I I I I I I I I I I I I : I I I I : t I I I : I I I I II I I : 

Db 697 SEIAKFEKSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ET 755 

Qy 7 64 MIEYENKEKLS7VLPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTA 823 

: : : : : I I : I I I I I I I I I I I I I : : I III : I : I I : I I I I I I I I I : I I 
Db 756 VTQHKHKERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTA 814 

Qy 824 VYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKS 8 83 

: Ml I I I III : : M M I I II I II I I I II I II I MM II M II I I M I M I 

Db 815 IYSNDDLLSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKS 870 

Qy 884 EIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSAL 943 

MM I I I I II M I I I II M III : I I M I I : I : I II II I M I I I 
Db 871 EIANVQSGANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVSKVPLLLPNVSAL 928 

Qy 944 GHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKT 1003 

M I : M I I II II I II I M I I I I II I I II I M : MM I I II Ml I I I II II II 



Db 



929 -ESQIEMGNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKT 987 



Qy 1004 GW-FGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 

III I I I I I II II I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I II I I I I I I I I I | | I 
Db 98 8 GWYFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 104 7 

Qy 1063 LESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I II M 
Db 104 8 LESEVAISEELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 

Qy 1123 GLTLLILALISLFSVPVI YERHQAQIDHYLGLAN 1178 

I I I I I II I I I I I I I : I I M I I I I I I I I I I I II I I I : I II I I I I I I I I II I II I I II 
Db 1108 GLTLLI LALI SLFSI PVI YERHQAQIDHYLGLANKSVKDAMAKIQAKI PGLKRKAE 1163 



RESULT 
Q8BGK7 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



PRELIMINARY; 



PRT; 1046 AA. 



OC 
OC 
OX 
RN 
RP 
RC 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
RN 
RP 
RC 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



(TrEMBLrel. 
(TrEMBLrel. 
(TrEMBLrel. 



23, 
23, 
25, 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus. 



Q8BGK7 
Q8BGK7; 
01-MAR-2003 
01-MAR-2003 
01-OCT-2003 
RTN4. 
RTN4 . 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N . A. 

STRAIN=129/SvcJ7, and 12 9SvcJ7; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4 . " ; 

Submitted (OCT-2002) to the EMBL/ GenBank/DDBJ databases. 

[2] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and 129SvcJ7; 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7 ; 
Van der Putten H.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[4] 

SEQUENCE FROM N.A. 

STRAIN=129SvcJ7; 

Van der Putten H., Mir A.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
EMBL; AY102280; AAM73502.1; -. 
EMBL; AY102286; AAM73507.1; -. 
MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
GO; GO:0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 



DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 1046 AA; 114221 MW; 8CE2E2238ED51222 CRC64; 



Query Match 61.8%; Score 3658.5; DB 11; Length 1046; 

Best Local Similarity 69.5%; Pred. No. 2e-172; 

Matches 767; Conservative 108; Mismatches 148; Inclels 81; Gaps 17; 

Qy 97 APPVAPERQ PSWDPS PVSSTVPAP SPLSAAAVSPSKLPED 136 

I I I : I I : I I I I I I I I I I I I 
Db 2 AP PLAG G GQ KG GAASEAWVPSLFVGVSGSTCTATVKSLVP I PAR SSRLSAA 51 

Qy 137 DEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLK 196 

: I : : I I I : I I : : : Mill; 
Db 52 RNETLFALPA ASEPVIPSSAE KIMDLK 78 

Qy 197 EQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVS 256 

I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I : : 
Db 79 EQPGNTVSSGQEDFPSVLFETAASLPSLSPLSTVSFKEHGYLGNLSAVASTEGTIEETLN 138 

Qy 257 EASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDE 316 

I I I : I : I : I : : I : III i I I I I I I I I I : II! I I I : : I I : I I : II : : I I : 

Db 139 EASRELPERATNPFVNRESAEFSVLEYSEMGSSFNGSPKGESAMLVENTKEEVIVRSKDK 198 

Qy 317 EEKLVSNNI LHXQQELPTALTKLVKEDEWS S EKAKDS FNEKRVAVEAPMREEYADFKPF 376 

I : I I : II III II I : I I I I I : I I I I I I I : : : I I I : I I I I I I II II 
Db 199 ED-LVCSAALHNPQESPATLTKWKEDGVMSPEKTMDIFNEMKMSWAPVREEYADFKPF 2 57 

Qy 377 ERVWEVKDSKEDS-DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPS 435 

I : I I I I I : I I I : I I I : I : II I I I I I I I I I I I I I I II II I : : I I I 

Db 258 EQAWEVKDTYEGSRDVLAA RANMESKVDKKCFEDSLEQKGHGKDSESRNENASFPR 313 

Qy 436 TPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKN 495 

III : I I I I I I I I I : : I I I I I I I I I : I I I I I I I I I I I I I I I : I I I I : I I I 
Db 314 TPELVKDGSRAYITCDSFS-SATESTAANIFPVLEDHTSENKTDEKKIEERKAQIITEK- 371 

Qy 496 TSTKTSNPFFVAAQDSETDYWTDNLTKVTEEWANMPEGLTPDLVQEACESELNEVTGT 555 

I I I I I I I I II III I I I I I I I I : I I I I III 1 I I I I I II I I I I I I I I I M I III 
Db 372 TSPKTSNPFLVAIHDSEADYVTTDNLSKVTEAWATMPEGLTPDLVQEACESELNEATGT 431 

Qy 556 KIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGA 615 

I I I I I II : I I I I I I I : I I I : I I I I I I I I I I I : II I I I I I I I I I I I I I I I I I : I I II 

Db 432 KIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEATPSPVLPDIVMEAPLNSLLPSTGA 4 91 

Qy 616 SVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMSVSLKVSGIKEEIKEPENINAALQE 674 

II I II : I I I I I I : I : II I I I I I I I I I I I I I I : I I I I I I I I I I I : III II 

Db 492 SVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSVALKTSDSKEEIKEPESFNAAAQE 551 

Qy 675 TEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS 734 

I I I I I I I II I I I I I I I I I I I : I : I I : I I I : II I : I I I I I I I : I I I I : I II I I I I I 
Db 552 AEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFEKSVPDHCELVDDSSPESEPVDLFS 611 

Qy 735 DDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDN 794 

I I I I I : I II I : I II I : I I I I I I I I : : : : : : I I : I I I I I MINIM: : I 
Db 612 DDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHKERLSASPQEVGKPYLESFQPNLHI 670 

Qy 795 TKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIID 854 

III : I : I I : I II I I II II M M I II II I III : : M M I II I II II I II I 



Db 



671 TKDA-ASNEIPTLTKKETISLQMEEFNTAIYSNDDLLSSKEDKMKESETFSDSSPIEIID 729 





003 


EFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVE 
1 1 1 1 : 1 : 1 II : 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 M : 1 1 1 I M 1 1 Ml 
EFPTFVSAKDDS PKEYTDLEVSNKSEIANVQSGANSLPCSELPCDLSFKNTYPKDE 


914 


DO 


730 


785 


Qy 


915 


EKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKE 
: 11:11: | : | | | | | : | | || : , | : : | | | | | | | | | | | : | | | | | | || | 

AHV--SDEFSKSRSSVSKVPLLLPNVSAL-ESQIEMGNIVKPKVLTKEAEEKLPSDTEKE 


974 


Db 


786 


842 


Qy 


n t c 

9/5 


DRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALL 
Ml : 1 : M:l 1 1 1 II II 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 II 1 1 1 1 I I I | | | | | | | | | | | | | 
DRSLTAVLSAELNKTSVVDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALL 


1034 


Db 


843 


902 


Qy 

Db 


1035 
903 


S VT I S FRI YKGVI QAI QKS DEGHP FRAYLE S EVAI S EELVQK YSNS ALGHWCT I KELRR 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I 
SVT IS FRI YKGVI QAI QKS DEGHPFRAYLES EVAI SEELVQKYSNSALGHVNST I KELRR 


1094 
962 


Qy 

Db 


1095 
963 


LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGL 
1 N 1 1 1 1 1 || I | | | || | | | | | | | | | | | | | | | | | | : | | | | | M | | | | | | | | | | 

LFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLIIALISLFSIPVIYERHQAQIDHYLGL 


1154 
1022 


Qy 


1155 


ANKNVKDAMAKIQAKIPGLKRKAE 117 8 
I 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 
ANKSVKDAMAKIQAKIPGLKRKAE 104 6 




Db 


1023 





RESULT 
Q7TNB7 
ID 
AC 
DT 
DT 
DT 
DE 
OS 
OC 
OC 



PRELIMINARY; 

(TrEMBLrel. 25, 
(TrEMBLrel. 25, 
(TrEMBLrel. 25, 



PRT; 



720 AA. 



Created) 

Last sequence update) 
Last annotation update) 



Craniata; Vertebrata; Euteleostomi ; 
Rodentia; Sciurognathi ; Muridae; Murinae; Mus , 



Q7TNB7 
Q7TNB7; 
01-OCT-2003 
01-OCT-2003 
01-OCT-2003 
Hypothetical protein. 
Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia ; Eutheria ; 
OX NCBI_TaxID=10090; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=C57BL/6; TISSUE=Brain; 
RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 
RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler 
RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N. 
RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. 
RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 
RA Stapleton M. , Soares M.B., Bonaldo M.F. 
RA Brownstein M.J., Usdin T.B., Toshiyuki 
RA Raha S.S., Loquellano N.A., Peters G.J. 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A. , Gunaratne P.H., 
RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J,, Hulyk S.W., 
RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A., 
RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 
RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 
RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 



G. 

,K. 



D. 



Casavant T.L., Scheetz T.E., 
S., Carninci P., Prange C, 
, Abramson R.D., Mullahy S.J., 



RA Rodriguez A.C., Grimwood J . , Schmutz J., Myers R.M., Butterfield Y.S., 

RA Krzywinski M.I., Skalska U., Smailus D.E., Schnerch A., Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6; TTSSUE=Brain; 

RA Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC056373; AAH56373.1; -. 

KW Hypothetical protein. 

SQ SEQUENCE 720 AA; 77435 MW; 80AB7 8728F16EAB2 CRC64; 



Query Match 43.6%; Score 2580.5; DB 11; Length 720; 

Best Local Similarity 70.8%; Pred. No. 2.1e-119; 

Matches 527; Conservative 69; Mismatches 107; Indels 41; Gaps 13 

QY 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

'11:111 I I | | | | | | | | | | | | | M | | I M I I I : I I : I I I I :: I I II I I I I I M I II 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VP T APAAGAP LMD FGN D FVP PAP RG P L P AAP P VAP E RQP S WD P S P VS S T VP A 119 

I I I I I I I I I I I I I I I I : I I : I || I I | | | | | | | | | | M I I I I I I : II : I I 

Db 61 AAGLSAAPVP — PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

QY 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II M I I I M I I I I I I I I I I II II | | | | I I I I I I I I I M I I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLS 225 

II : M I I I I I I I I I I : I : I I I I M I | | | | | | | | M I 

Db 164 GSGSVDETLFALPAASEPVIPSSAEKIMDLKEQPGNTVSSGQEDFPSVLFETAASLPSLS 223 

Qy 226 PLSAASFKEHEYLGNLSTVLPTEGTLQENVSEAS KEVSEKAKTLLIDRDLTEFSELEYSE 285 

IN II I I I I I I I I : : I : : I I I : I : I : I : : | : ! I I I I i I I 

Db 224 PLSTVSFKEHGYLGNLSAVASTEGTIEETLNEASRELPERATNPFVNRESAEFSVLEYSE 283 

Qy 286 MGS S FSVS PKAESAVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEV 345 

Mill: I ::l I :||:||::||:|: || : II III |||:| III I 

Db 284 MGSSFNGSPKGESAMLVENTKEEVIVRSKDKED-LVCSAALHNPQESPATLTKWKEDGV 342 

Qy 346 VSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLES 404 

: I II I Ml : : : I I I : I I I I I I I I M I : I I I I I : I I hill : I : I I 

D b 343 MSPEKTMDIFNEMKMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANMES 398 

Qy 4 05 KVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATN 4 64 

I I I I II I I I M I I I :: I I I I I I : | | | | | | | | I : : | | || I I 

Db 399 KVDKKCFEDSLEQKGHGKDSESRNENASFPRTPELVKDGSRAYITCDSFS-SATESTAAN 457 

Qy 465 IFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKV 524 

Mhlll I II I I I I I I I I I : I M I : I I I II I I I I 1 I II I I I I I I I I I I I : I I 
Db 458 IFPVLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAIHDSEADYVTTDNLSKV 516 



Qy 



525 TEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCP 584 
H II M I I I I I I I I I | | | | | I I | I I M I I I r I I I I I I I :|||:|| Mill 



Db 



517 TEAWATMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCP 57 6 



QY 585 SFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPP 643 

I I M : I I I I I I I I I I I I | | | | I | | : | | MM I M : I I I I I | : | : \\ |||||| 
Db 577 SFEEAEATPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPP 636 

QY 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDY 703 

III -II I I I I I I I I I : Ml M IMMMM Ml M : I : I I : I 

Db 637 PYEEAMSVALKTSDSKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNY 696 

Qy 704 SEMAKVEQPVPDHSELVEDSSPDS 727 

M : I I I : MM I I I : I I I I : I 
Db 697 SEIAKFEKSVPDHCELVDDSSPES 720 



RESULT 6 
Q8K290 



ID Q8K290 PRELIMINARY; PRT; 639 AA. 

AC Q8K290; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

GN RTN4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC032192; AAH32192.1; -. 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 639 AA; 70312 MW; 309A19DA37 603F11 CRC64; 



Query Match 42.7%; Score 2527; DB 11; Length 639; 

Best Local Similarity 79.5%; Pred. No. 7.7e-117; 

Matches 515; Conservative 57; Mismatches 66; Indels 10; Gaps 6; 

QY 532 MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEA 591 

I I I I I I M M I I II M M II II M I II I M : I I I I I II : I II : M I I I M M M : I I 
Db 1 MPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESIYPTAQLCPSFEEAEA 60 

Qy 592 TPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMS 650 

I I I I I I I I I M M I I I I : I I I :|||| | | : | : | | I | I | II II I II I I 

Db 61 TPSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMS 12 0 

Qy 651 VSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

1 = 11 I I I I I I II I : III || I II I II II I I I I I I II II || : | : | | : | || : | I | 
D b 121 VALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPEFSNYSEIAKFE 180 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



711 QPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENK 770 

: I I I I I I I : I I I I : I I I I I I I I I I I I I : I I I . I : I II I : I I I I I I I I : : : : : : I 
181 KSVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHK 239 

7 71 EKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDL 830 

I = I I t I I I : : I Ml : I : I I : I II I I M I I : I I : I I I I I I 

24 0 ERLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTAIYSNDDL 298 



831 



299 



891 



355 



951 



412 



1011 



472 



1071 



FISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPD 890 

I I I I : M I I I M II I I I I I I I I : I : I I I : I I M I I I I : I II I I I 

LSSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKSEIANVQS 354 

GAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEI 950 
I I I I I I : I I I I I I M III : I I : I I I : I : I M I I I : I I I I : I I : 
GANSLPCSELPCDLSFKNTYPKDEAHV — SDEFSKSRSSVSKVPLLLPNVSAL-ESQIEM 411 

ESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGAS 1010 

•I I I I I I I I I I I : I I I M I I I M I I : I : I I : I I I I I I M I I I I I M I I I I I M I I 
GNIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGAS 471 

LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAIS 107 0 
I i I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I II I II I I I I I II I I I I I M I I I I I 
LFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAIS 531 



EELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILA 1130 
I I I I I I I I I M I I I I I I I M I I I II I I I I I I I I I II I I I I II I I I I II I I I I I I II I I I 
532 EELVQKYSNSALGHWSTIKELRRLFLVDDLVDSLKFAVI^WFTWGALFNGLTLLILA 591 

1131 LISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 117 8 
I I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I II I I I I I I I I I M 
592 LISLFSIPVIYERHQAQIDHYLGIANKSVKDAMAKIQAKIPGLKRKAE 639 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
DR 



Created) 

Last sequence update) 
Last annotation update) 



RESULT 7 
Q80W95 

ID Q8 0W95 PRELIMINARY; PRT; 578 AA. 

Q80W95; 

01-JUN-2003 (TrEMBLrel. 24, 
01-JUN-2003 (TrEMBLrel. 24, 
01-OCT-2003 (TrEMBLrel. 25, 
Nogo-A (Fragment) . 
NOGO-A. 

Mus musculus (Mouse) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus, 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N . A. 
Tozaki H., Hirata T.; 

"The partial sequence of mouse nogo-A cDNA clone#4 109 . " ; 
Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. 
EMBL; AB073672; BAC75974.1; -. 

GO; GO: 0005783; C : endoplasmic reticulum; IEA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 



FT NONJTER 1 1 

SQ SEQUENCE 578 AA; 63696 MW; 832 67 0C17 1E4AC61 CRC64; 



Query Match 37.7%; Score 2233; DB 11; Length 578; 

Best Local Similarity 78.4%; Pred. No. 2.2e-102; 

Matches 460; Conservative 52; Mismatches 65; Indels 10; Gaps 6; 

QY 593 PSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENPPPYEEAMSV 651 

I I I I I I I I I I I I I I I I : II I I I I I I I : I II I I I : I : I I I I M I I I I I I I I I I 
Db 1 PSPVLPDIVMEAPLNSLLPSTGASVAQPSASPLEVPSPVSYDGIKLEPENPPPYEEAMSV 60 

QY 652 SLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQ 711 

: I I I I I I M I I I : II I II I I II I I I I I I I I I I I I I I I I : | | | : ! | | : | | | : 
Db 61 ALKTSDAKEEIKEPESFNAAAQEAEAPYISIACDLIKETKLSTEPSPGFSNYSEIAKFEK 120 

QY 712 PVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKE 771 

I I I I M I : I I I I : I I I I I I I I II I II : I I I I : I I I I : I II M I I I : : : : : : | | 
Db 121 SVPDHCELVDDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-ETVTQHKHKE 179 

Qy 772 KLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLF 831 

: I I I I I I I I I I I I I : : I Ml : I : I I : I I I I I I I I I : II : I I II I I 

Db 18 0 RLSASPQEVGKPYLESFQPNLHITKDA-ASNEIPTLTKKETISLQMEEFNTAIYSNDDLL 2 38 

QY 832 ISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDG 8 91 

I I I ::: I : I II I I I I I I I I I I I I M : I : I II : I I I I M I I : I I I I I I I 

Db 239 SSKEDKMKESETFSDSSPIEIIDEFPTFVSAKDDS PKEYTDLEVSNKSEIANVQSG 294 

QY 8 92 AGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIE 951 

11111:11111111 I I I : I I : I I I : I : I I I I I : I M I : I | : 
Db 295 ANSLPCSELPCDLSFKNTYPKDEAHV--SDEFSKSRSSVFKVPLLLPNVSAL-ESQIEMG 351 

QY 952 SIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSVVDLLYWRDIKKTGWFGASL 1011 

: I I I I I II I I I I : I I I I II I I I I I I : | : | | : | I I I I II I I I I I I I I I I I I I I II I I 
Db 352 NIVKPKVLTKEAEEKLPSDTEKEDRSLTAVLSAELNKTSWDLLYWRDIKKTGWFGASL 411 

Qy 1012 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 1071 

I M I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I | | | | M | M I I 
Db 412 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 471 

Qy 1072 ELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILAL 1131 

M I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I II I I 

Db 472 ELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILAL 531 

Qy 1132 ISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

I I I M : I I I I I I I I I I I I I II I II IE : I I I I I I I I I I I | | | | | | | | | 
Db 532 I SLFS I PVI YERHQAQI DHYLGKANKS VKDAMAKIQAKI PGLKRKAE 578 

RESULT 8 
Q96B16 



ID Q96B16 PRELIMINARY; PRT; 392 AA. 

AC Q96B16; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein (RTN4) . 

GN RTN4 . 



OS 


Homo sapiens (Human) . 




oc 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


oc 


i idiLULLci x j_ d f n, u uii c jl x d , riiiiiaLcs / La tdrrniiii ; nomxnxaae ; Homo 


* 


ox 


NCBI TsvTn=9606* 




RN 


[11 _ 




RP 


SEQUENCE FROM N.A. 




RC 


TISSUE=Kidney; 




RA 


Strausberg R. ; 




RL 


ouumictea \uui zuui; to tne imvidW benbank/ DDB J databases. 




RN 


L ^ J 




RP 


SROUKMPF FROM M A 




RA 


Oertle T van der Pnffpn N ^nhiAT^K m tp 




RT 


<jeiiunu_c oLiucLure ana runctionai Lnaracterization of the 


Promoter 


RT 


oLiuLLuicb ul nuuian ana inouse JNogo/ Ktn - 4 . ; 




RL 


ouumiLtea iuli zuuz j to tne rJVlrJL/ benBanK/ DDB J databases. 




RN 


r 3 1 

L *-* J 




RP 


^FnTTFNTF from m a 




RA 






RL 


o Lijjxtu. l, tea ^i v irt.i zuuz; to tne &mdl/ benoanK/ DDBJ databases. 




RN 


r 4 "j 

L ^ J 




RP 


SEOUENCE FROM N A 




RA 


Van der Pnffpn H 




RL 


ouwmiLLeu -£.uuz; to tne rjJYiDij/ CjenrsanK/ DDB J databases. 




RN 


r si 




RP 


^FOTTFMPF FDHM T\T A 




RX 






RA 


Oertle T van df^r Pnt-t-pn H ^r^ViT^aVN M P 




RT 


^jciiuilixu ouiuoture ana runctionai bnaractenzation or tne 


Promoter 


RT 


oLiuLLuitb 01 numan ana riouse JNOQo/Ktn~4. ; 




.RL 






DR 


EMBL; BC0'1616S' AAH1 61 fiS 1* - 




DR 


EMBL; AY102285; AAM64242.1; 




DR 


EMBL; AY102278; AAM64247.1; 




DR 


GO; GO: 0005783; C : endoplasmic reticulum; IEA. 




DR 


InterPro; IPR003388; Reticulon. 




DR 


Pfam; PF02453; Reticulon; 1. 




DR 


PROSITE; PS50845; RETICULON; 1. 




KW 


Hypothetical protein. 




SQ 


SEQUENCE 392 AA; 42274 MW; D7B2AA5E839E58AD CRC64; 





Query Match 25.8%; Score 1531; DB 4; Length 392; 

Best Local Similarity 32.4%; Pred. No. 6.1e-68; 

Matches 382; Conservative 7; Mismatches 3; Indels 786; Gaps 4 
1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 




Db 



1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



Qy 



61 AG L S AAP VP T APAAGAP LMD FGN D FVP P AP RG P L PAAP P VAP E RQ P S W D P S P VS S T VP AP 120 



Db 




61 AGL S AAP VP T APAAGAP LMD FGND FVP PAP RG P L PAAP P VAP E RQ P S WD P S P VS S T VP AP 12 0 



Qy 



121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



Db 




121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 



Qy 



181 SSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGN 240 



Db 


181 


Qy 


241 


Db 


186 


Qy 


301 


Db 


186 


Qy 


361 


Db 


186 


Qy 


421 


Db 


186 


Qy 


481 


Db 


191 


Qy 


541 


Db 


191 


Qy 


601 


Db 


191 


Qy 


661 


Db 


196 


Qy 


721 


Db 


196 


Qy 


781 


Db 


200 


Qy 


841 


Db 


200 


Qy 


901 


Db 


200 


Qy 


961 : 


Db 


200 


Qy 


1021 ; 



111:1 

SSGSV 185 



185 



1Q5 

361 AVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNH 420 

DSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDE 480 
1:1 I 

186 DETLF 190 

481 KKIEEKJ^QIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPDL 54 0 

541 VQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPS FEES EAT PSPVLPDI 600 

190 



-ALPAA- 



195 



195 



-SEPV 199 



199 



199 

901 PHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLE 960 

199 



: : II I I I I I I I I I I I I I I I | | I | | | | | M I I I 
- 1 RS S AWDLL YWRDI KKTGWFGAS LFLLLS LTVF 234 



I I I N I I I I I I I I I I I I I I | | M I I I I I I I I I I I I I I I I I | | | | | | | M II I I II I I I I 



Db 


235 


SIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNS 


294 


Qy 


1081 


ALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 
If 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 II M 1 ! 1 1 1 1 1 1 I M 1 I 1 1 1 1 1 1 1 I 1 1 1 ! 1 1 1 1 1 i i i 1 

1 ' 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 I 1 1 1 M 1 M l 1 1 1 1 M 1 1 II 1 1 1 1 II 1 1 1 1 | | | ! | | | | | | | | 

ALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVI 


1140 


Db 


295 


354 


Qy 


1141 


YERHQAQI DH YLGLANKNVKDAMAKI QAKI P GLKRKAE 117 8 

1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

YERHQAQI DHYLGIANKNVKDAMAKIQAKI PGLKRKAE 392 




Db 


355 





RESULT 
Q8BHF5 
ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 



OC 

OC 

OX 

RN 

RP 

RC 

RA 

RT 

RT 

RL 

RN 

RP 

RC 

RA 

RL 

RN 

RP 

RC 

RA 

RL 

RN 

RP 

RC 

RA 

RL 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

SQ 



Created) 

Last sequence update) 
Last annotation update) 



Rodentia; 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus , 



Q8BHF5 PRELIMINARY; PRT; 375 AA. 

Q8BHF5; 

01-MAR-2003 (TrEMBLrel. 23, 
01-MAR-2003 (TrEMBLrel. 23, 
01-OCT-2003 (TrEMBLrel. 25, 
RTN4 . 
RTN4 . 

Mus musculus (Mouse) . 
Eukaryota; Metazoa; Chordata; 
Mammalia ; Eutheria ; 
NCBI_TaxID=10090; 
[1] 

SEQUENCE FROM N.A. 

STRAIN=129/SvcJ7, and 129SvcJ7; 

Oertle T., van der Putten H., Schwab M.E.; 

"Genomic Structure and Functional Characterization of the Promoter 

Structures of Human and Mouse Nogo/Rtn-4 . " ; 

Submitted (OCT-2002) to the EMBL/GenBank/DDB J databases. 

[2] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7, and . 129SvcJ7; 
Oertle T., Schwab M.E.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 
[3] 

SEQUENCE FROM N.A. 
STRAIN=129/SvcJ7; 
Van der Putten H. ; 
Submitted (MAY-2002 
[4] 

SEQUENCE FROM N.A. 
STRAIN=129SvcJ7; 
Van der Putten H., Mir A.; 

Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases 
EMBL; AY102282; AAM73504.1; -. 
EMBL; AY102286; AAM73509.1; -. 
MGD; MGI: 1915835; Rtn4 . 

GO; GO: 0005783; C : endoplasmic reticulum; IDA. 
GO; GO: 0007399; P : neurogenesis ; IDA. 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

SEQUENCE 375 AA; 40300 MW; 23D9EB19BE67 1AE6 CRC64; 



to the EMBL/ GenBank/DDBJ databases. 



Query Match 



20.3%; Score 1201.5; DB 11; Length 375; 



Best Local Similarity 28.6%; Pred. No. l.le-51; 

Matches 337; Conservative 17; Mismatches 20; Indels 805; Gaps 11 

MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 5 9 

111 = 111 Mill MINI I IIIIM:||:||||::|||||IMIIIIM 

MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

AAGL S AAP VPTAP AAGAP LMD FGND FVP PAP RGP L P AAP P VAP E RQ P S WD P S P VS S T VP A 119 
MINIUM Ml llhll :| I I I I II I I I I I I I I MINIM: M : I I 
AAGLSAAPVP PAA-APLLDFS S DSVPPAPRGPLPAAPPTAPERQP SWERS PAAS A 114 

PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 
M N II II II II II II II I Nil || | | | | M II I II I I I II 



I I 

GS 165 



Qy 


l 


Db 


l 


Qy 


60 


Db 


61 


Qy 


120 


Db 


115 


Qy 


180 


Db 


164 


Qy 


240 


Db 


166 


Qy 


300 


Db 


166 


Qy 


360 


Db 


166 


Qy 


420 


Db 


166 


Qy 


480 


Db 


174 


Qy 


540 


Db 


174 


Qy 


600 


Db 


174 


Ov 


660 


Db 


179 


Qy 


720 ■ 


Db 


179 


Qy 


780 < 



165 



165 



165 



I I : I I 

-GSVDETLF 173 



173 

540 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 599 

173 



I : I : I 

-ALPAA ^7 3 



660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

178 

»SSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 
I II I 

SEPV 182 



Db 



183 



182 



Q^ 840 ETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 899 
Db 183 182 

QY 900 LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVL 959 
Db 183 1Q2 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTV 1019 

: I I : : I M I II I I I I I M I I I I I I I I I I I I I I I I 

Db 183 IPS S AWDLL YWRD I KKTGWFGAS LFLLLS LTV 216 

Qy 1020 FSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 1079 

I I I I I I M I M II I I I II I I I M I I || I I I | | | | | | | | | | | | | | || || | | | || | | | | | | | 
Db 217 FSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 276 

Qy 1030 SALGHWCTIKELRRLFLVDDLVDSLKFAVIM^FTYVGALFNGLTLLILALISLFSVPV 1139 

I I I I I I I I M II I I M I I I II II I I I I I I I I I | | | | | | | | | | | | | || | M | | | | | | : || 
Db 277 SALGHWSTIKELRRLFLVT)DLVDSLKFAVL 336 

QY H4 0 I YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 117 8 

I I I I M I I I I I I I I I I I I : I II I I II I I II I I I I I II I I 
Db 337 I YERHQAQI DHYLGLANKSVKDAMAKI QAKI PGLKRKAE 375 



RESULT 10 
Q8BH78 

ID Q8BH78 PRELIMINARY; PRT; 356 AA. 

AC Q8BH7 8; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update). 

DE RTN4 . 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostorni ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T . , van der Putten H., Schwab M.E.; 

RT "Genomic Structure and Functional Characterization of the Promoter 

RT Structures of Human and Mouse Nogo/Rtn-4 . " ; 

RL Submitted (OCT-2002) to the EMBL/ GenBank/ DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7, and 129SvcJ7; 

RA Oertle T., Schwab M.E.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129/SvcJ7; 

RA Van der Putten H.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/ DDB J databases. 



RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=129SvcJ7; 

RA Van der Putten H., Mir A.; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY102281; AAM73503.1; 

DR EMBL; AY102286; AAM73508.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 356 AA; 38403 MW; 4366C03BA9630B56 CRC64; 

Query Match 19.6%; Score 1163; DB 11; Length 356; 

Best Local Similarity 21.1%; Pred. No. 8e-50; 

Matches 326; Conservative 12; Mismatches 17; Indels 824; Gaps 8 

QY 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

111 = 111 Mill MINI I I I ! I I I I I I I I II : M : I | I | :: I | M | II I I I I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPA 119 

I I I I I I I I I I I I I I M : I I : I I I I I I I | | | | M I I I I II I I I I : I I : I I 

Db 61 AAGLSAAPVP— PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

QY 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRR 179 

II I I I I I I I I I I M I I I M I I I I M I I || I || || | | | | | | | 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

Qy 180 GS SGAWXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPS LS PLSAAS FKEHEYLG 239 

I I 

Db 164 GS 165 

Qy 240 NLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESA 299 
Db 166 165 

Qy 300 VIVANPREEI IVKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEVVSSEKAKDSFNEKR 359 
Db i 66 

Qy 360 VAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTN 419 
Db 166 165 

QY 4 20 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 
Db 166 165 

Qy 48 0 EKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEVVANMPEGLTPD 539 
Db 166 __ 165 



Qy 



540 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 599 



Db 



166 165 



Qy .600 IVMEAPLNSAVPSAGASVIQPSSSPLEASSWYESIKHEPENPPPYEEAMSVSLKVSGIK 659 

Db 166 165 

Qy 660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

Db 166 165 

Qy 720 VEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 779 

Db 166 165 

Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

Db 166 165 

Qy 840 ETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTE 899 

I I : 

Db 166 GSV 168 

Qy 900 LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVL 959 

Db 169 168 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTV 1019 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 169 WDLLYWRDI KKTGWFGAS LFLLLSLTV 197 

Qy 1020 FSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSN 1079 

I I M I I I I I I I M M I I I I I I I! I I I I I I II I I I I I I I I I I I I M I I II I I I I I I I I I I I 
Db 198 FS I VS VTAYI ALALLS VT I S FRI YKGVI QAI QKS DEGH P FRAYLES EVAI S EELVQKYSN 257 

Qy 1080 SALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPV 1139 

I I I I I I I I I I I II I II I II I I I I I I I I I M I II I I I I I I I I I I I I I I I I II I I I I I : M 
Db 258 SALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIPV 317 

Qy 1140 I YERHQAQI DHYLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

I t I I I I I I I I I I II I II I : I I I M I I I I I I I I I I I I I I I 
Db 318 I YERHQAQIDHYLGLANKSVKDAMAKI QAKI PGLKRKAE 356 



RESULT 11 
Q8K3G7 

ID Q8K3G7 PRELIMINARY; PRT ; 357 AA. 

AC Q8K3G7; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Nogo-B. 

GN RTN4 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 



RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/ c ; 

RA Jin W., Li R., Long M. , Shen J., Ju G.; 

RT "Cloning and expression of the mouse Nogo-B protein."; 

RL Submitted (MAY-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AY114153; AAM77069.1; 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C : endoplasmic reticulum; IDA. 

DR GO; GO: 0007399; P : neurogenesis ; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 357 AA; 38566 MW; 73BB3D17DFDBDF15 CRC64 ; 

Query Match 19.5%; Score 1152.5; DB 11; Length 357; 

Best Local Similarity 27.6%; Pred. No. 2.6e-49; 

Matches 326; Conservative 12; Mismatches 17; Indels 825; Gaps 9; 

Qy 1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKP 59 

llhlll Mill I I II I I I I I I I I I I I I I I I I : I I : I I I I :: I I I I I I I I M I I I I 
Db 1 MEDIDQSSLVSSSADSPPRPPPAFKYQFVTEPEDEEDEEDEEEEEDDEDLEELEVLERKP 60 

Qy 60 AAGL S AAP VPTAP AAGAP LMD FGND EVP PAP RG P L P AAP P VAP ERQ P S WDP S P VS S T VPA 119 

I I I I I I I M I III 111:11 : I I I I I I I I I I I I I I I I I I I I | M : I I : | I 
Db 61 AAGL S AAP VP — PAA-APLLDFSSDSVPPAPRGPLPAAPPTAPERQPSWERSPAAS A 114 

Qy 120 PSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRR 179 

M II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I 

Db 115 PSLPPAAAVLPSKLPEDDEPPAR — PPAPAGASPLAE PAAPPSTPAAPKRR 163 

QY 180 GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLG 239 



Db 164 GS 165 

Qy 240 NLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSVSPKAESA 299 

Db 166 165 

Qy 300 VIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKR 359 

Db 166 165 

Qy 360 VAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTN 419 

Db 166 165 

Qy 420 HEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTD 479 

Db 166 165 

Qy 4 80 EKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANMPEGLTPD 539 

Db 166 165 

Qy 54 0 LVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLPD 599 



Db 



166 



165 



Qy 600 IVMEAPLNSAVPSAGASVIQPSSSPLEASSWYESIKHEPENPPPYEEAMSVSLKVSGIK 659 

Db 166 ■ 165 

Qy 660 EEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSEL 719 

Db 166 165 

Qy 720 VEDSS PDSEPVDLFSDDS I PDVPQKQDETVMLVKESLTETS FESMIEYENKEKLSALPPE 779 

Db 166 165 

Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKI PLQMEELSTAVYSNDDLFI SKEAQI R 839 

Db 166 165 

Qy 840 ETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK3EIANAPDGAGSLPCTE 899 

I 1 : 

Db 166 GSV 168 

Qy 900 LPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVL 959 

Db 169 168 

Qy 960 EKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGW-FGASLFLLLSLT 1018 

I I II I II I I I I I I I I ! I I I I I I I I I I I I 
Db 169 WDLLYWRDIKKTGWYFGASLFLLLSLT 197 

Qy 1019 VFS IVSVTAYI ALALLSVT I S FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYS 1078 

M I 1 I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I I II I I I I I I 
Db 198 VFS I VS W AYI ALALL S VT IS FRI YKGVI QAI QKS DEGHP FRAYLES EVAI S EELVQKYS 257 

Qy 1079 NSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWFT 1138 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I : I 
Db 258 NSALGHWSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSIP 317 

Qy 1139 VI YERHQAQI DH YLGLANKNVKDAMAKI QAKI PGLKRKAE 1178 

I I I I I I I I I II I I I I I I I I : I I I I I I I I I I I M I I I II I I 
Db 318 VI YERHQAQI DHYLGLANKSVKDAMAKIQAKI PGLKRKAE 357 



RESULT 12 
Q7YRW9 

ID Q7YRW9 PRELIMINARY; PRT; 184 AA. 

AC Q7YRW9; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE RTN4w (Fragment) . 

GN RTN4 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 



RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=1283228 8; 

RA Oertle T., Klinger M., Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164744; AAP47319.1; -. 

FT NONJTER 1 1 

SQ SEQUENCE 184 AA; 20671 MW; DE990E03BBAF84A1 CRC64; 

Query Match 15.2%; Score 901; DB 6; Length 184; 

Best Local Similarity 98.9%; Pred. No. 2.8e-37; 

Matches 182; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



QY 995 LYWRDIKKTGWFGASLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSD 1054 

N I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I I | | | | | | 
LYWRDIKKTGWFGXXLFLLLSLTVFSIVSWAYIALALLSVTISFRIYKGVIQAIQKSD 60 



Db 1 



Qy 1055 EGHPFRAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMV7VF 1114 

N I I I I I I I I II MINIMI I I I I I I M I I I I I I I I I I I I M I I I 

Db 61 EGHPFRAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLV1)DLVDSLKFAVXMWVF 120 

Qy 1115 TWGALFNGLTLLILALISLFSVPVI YERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 1174 

I I I I I II I I I I II I I M M I I I I I II I I I I I I I I M I I I I II I I I I I I I II I I I I I I I I I 
Db 121 TWGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLK 180 

Qy 1175 RKAE 1178 

I I II 

Db 181 RKAE 184 



RESULT 
Q7T224 
ID 
AC 



13 



Created) 

Last sequence update) 
Last annotation update) 



Q7T224 PRELIMINARY; PRT; 199 AA. 

Q7T224; 

DT 01-OCT-2003 (TrEMBLrel. 25, 

DT 01-OCT-2003 (TrEMBLrel. 25, 

DT 01-OCT-2003 (TrEMBLrel. 25, 

DE RTN4-C. 

GN RTN4 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus . 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=12832288 ; 

RA Oertle T., Klinger M. , Stuermer C.A. , Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17:1238-1247(2003). 

DR EMBL; AY164737; AAP47312.1; -. 

SQ SEQUENCE 199 AA; 22293 MW; 07CF4E4EF2723251 CRC64; 



Query Match 15.0%; Score 887; DB 13; Length 199; 

Best Local Similarity 88.2%; Pred. No. 1.6e-36; 



Matches 180; Conservative 



7; Mismatches 11; Indels 



6; Gaps 



l; 



Qy 975 DRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALL 1034 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I 

D b 2 DSQPSG WKDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALL 55 

Qy 1035 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRR 1094 

I I I I M I I I I M I I I I I I I I I I I I I I I I I II : I I : I I : I : I I I I : I I I : I I : II I I I 
Db 56 SVTISFRIYKGVIQAIQKSDEGHPFRAYLESDVAVSEDLIQKYSSWLGHINGTVKELRR 115 

Qy 1095 LFLVDDLVDS LKFAVLMWVFT YVGALFNGLT LLI LAL I S L FS VPVI YERHQAQI DH YLGL 1154 

I I I I I I M I I II I I I I II I I I I | | I | || || | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 116 LFLVT)DLVDSLKFAVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGL 175 

Qy 1155 ANKNVKDAMAKIQAKIPGLKRKAE 1178 

M I I I I I I I I I I I I II I I I I I I 
Db 176 WKNVKDAMAKIQAKIPGLKRKTE 199 



RESULT 14 
Q9GM33 

ID Q9GM33 PRELIMINARY; PRT; 179 AA. 

AC Q9GM33; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Osada N., Hida M. , Kusuda J., Tanuma R. , Iseki K. , Hirai M., Terao K., 

RA Suzuki Y., Sugano S., Hashimoto K. ; 

RT "Isolation of full-length cDNA clones from macaque brain cDNA 

RT libraries."; 

RL Submitted (OCT-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB049853; BAB16739.1; 

DR GO; GO: 0005783; C : endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 179 AA; 19949 MW; 5F8CD4383FEE9E02 CRC64; 

Query Match 14.6%; Score 867; DB 6; Length 179; 

Best Local Similarity 98.3%; Pred. No. 1.3e-35; 

Matches 176; Conservative 3; Mismatches 0; Indels 0; Gaps 0; 

Qy 1000 IKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPF 1059 

: M I I I I I I II I I I I I I I I I I I 1 I I I | || | | | || | | | | | | | | | | : | | | | | | | | | | | | | | | 
1 MKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYRGVIQAIQKSDEGHPF 60 



QY 



1060 RAYLESEVAI S EELVQKYSNSALGHWCT I KELRRLFLVDDLVDS LKFAVLMWVFT YVGA 1119 



M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I : I I M I I I I I I I I I 
Db 61 RAYLESEVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLRFAVLMWVFTYVGA 120 

Qy 1120 LFNGLTLLII^ISLFSVPVIYERHQAQIDHYLGIANKNVKDAMAKIQTVKIPGLKRKAE 1178 

I I M I I I I 1 I I I II I I I I I I II II I I I I I I I I I I I II I I I I I I I I 1 ! I I I I I II I I | | I 
Db 121 LFNGLTLLII^ISLFSVPVIYERHQAQIDHYLGI^KNVKDAMAKIQAKIPGLKRKAE 17 9 



RESULT 15 
Q90638 

ID Q90638 PRELIMINARY; PRT; 760 AA. 

AC Q90638; 

DT 01-NOV-1996 (TrEMBLrel. 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE ChS-Rex-b (RTNl-A) . 

GN RTN1. 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_Tax I D= 9 0 3 1 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=96386034; PubMed=87938 64 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G., Adu J., Davies A.M., 

RA Georgiev G.P., Buchman V.L.; 

RT "Intracellular compartmentalization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=97183663; 

RA Ninkina N.N., Baka I.D., Buchman V.L.; 

RT "Rat and chicken s-rex/NSP mRNA: nucleotide sequence of main 

RT transcripts and expression of splice variants in rat tissues."; 

RL Gene 184:205-210(1997). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22715887; PubMed=1283228 8 ; 

RA Oertle T . , Klinger M. , Stuermer C.A., Schwab M.E.; 

RT "A reticular rhapsody: phylogenic evolution and nomenclature of the 

RT RTN/Nogo gene family."; 

RL FASEB J. 17: 1238-1247 (20Q3).. 

DR EMBL; U17606; AAC60075.1; -. 

DR EMBL; AY164724; AAP47301.1; 

DR GO; GO: 0005783; C: endoplasmic reticulum; IEA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

SQ SEQUENCE 760 AA; 82502 MW; 4 65C1B429F799D5C CRC64; 

Query Match 13.3%; Score 788.5; DB 13; Length 760; 

Best Local Similarity 27.7%; Pred. No. 7e-31; 

Matches 296; Conservative 115; Mismatches 315; Indels 341; Gaps 38; 



143 PPPPPPASVSPQAEPWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNT 202 
III : : I I I I I | IN . | 

4 PPDPQDLLLAGTAER-W AAAGADEYAAGAA1RDGDGA QQREQL 45 

203 ISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEV 262 
: I I •• I I M I I : : I | : | : : : | 

46 AFGS AREH P P VAMATA S PGVTAS S RLFD Y GSS SAN GADS SFYT 88 

263 SEKAKTLLID RDLTEFSELEYSEMGSSFSVSPKAESAVIVANPREEIIVKNKDE 316 

I 1:1 I I I I : : I : | | | : : | : : : | 
89 s LISDVHYTTPRDNTYFTGV-YQQENSPIPCSGSTEGFNALGHPVQDV TGFE 139 

317 EEKLVSNNILHXQQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPF 376 

1 I I I : : I I I : : | | : | : 

140 SRGLFS LDSGIEMTPAESA EVDKSLTDPMKVEGYKYMDI 178 

377 ERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDD — TSFP 434 

11:1 I I : I I I : I : 

179 SRPEEIK YQEKHDPDSEDESPDLIDEYR 206 

435 STPEGIKDRSGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEK 4 94 

IM II I 1:1:1 1:11 |: :: | :|| 

207 GTPIG SGH AAEPQRTTASEAIKA PKEQDPLE DKS FRDQHNAS WTAP 253 

495 NTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWANM PE GLTP 538 

III II : I : I I I : II | 

254 VKITLTETPGAREATSKEASVTQPKSGLKPSHEWPTVMVSEPEDDSPGSVTPPSSGTEP 313 

539 DLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLP 598 

: : I : I M : I I I I I II I I : | | | 

314 SGSESQGKGSLSEDELISAIKEAKGFSFETSEVQQS PAV SAEKQEQKMKPGRP 366 

599 DIVMEAPLNSAVPSA GASVIQ-PSSS PLEAS SV NYESIKHEPENPPPYEEAMSVS 652 

: : I I : : I I Ml: I III I I I : | | | | | | 

367 — AVPSPLDNEASSAESGDSEIELVSEDPLAAEEVLHSNYMTFSH-IGGPPP — SPASPS 421 

653 LKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQP 712 

:: I ::M : | :: | ;|| :: | : | , ,, 

422 IQYSILREERE AELDSELIIESCD ASSASEESP KREQD 459 

713 VPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEK 772 

I =1 I ::|| || : 

460 SPLMKPMVMD IIKE ENSSR 478 

773 LSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFI 832 
I I I 

479 ASASDYE 485 

833 SKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA 892 

486 h: " 111 

q Hb ASKTTE-SRMNRE NLADSA 503 

893 GSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIES 952 

I : : I I I : : 

504 SYLKSS FVAPKVSSE PPTSAVSTEELKERI 533 



Qy 953 IVKPKVLEKEAEKKLPSDTEKEDRSPSAI-FSADLGKTSWDLLYWRDIKKTGWFGASL 1011 

I : I : I ' : I = Mil: || : : | | | M I I I : I I : I I I : I 

Db 534 ILKKPIEETWNQSKVSSKDSGKRSPLALPLLPFLNKQKAINLLYWRDIKQTGIVFGSLL 593 

Qy 1012 FLLLSLTVFSIVSVTAYIALALLSVTI S FRI YKGVI QAIQKS DEGHPFRAYLES EVAI S E 1071 

II Ml I I : I M M:IM II I I I I I I I I I : I I : I I : I I I I I I : I I I : I : : I : 
Db 594 LLLFSLTQFSWSWAYLALAGLSATISFRIYKSVLQAVQKTDEGHPFKAYLDMEMNLSQ 653 

Qy 1072 ELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLLILAL 1131 

: : I I I : : : I I I : I M I I I I I I I II I I I I I I I I I I : I I II I I I II I I I I I : I : 
Db 654 DQIQKYTDCLQLYVNSTVKELRRLFLVQDLVDSLKFAVLMWLLTYVG7VLFNGLTLLIMAV 713 

Qy 1132 ISLFSVPVIYERHQAQIDHYLGIANKNVKDAMAKIQAKIPGLKRKAE 1178 

: : I : : I I : I : : : I I I I I I I : : : I I I I I I I I I I I I I I 
Db 714 VSMFTLPWYDKYQAQIDQYLGLVRTHINTWAKIQAKIPGAKRKAE 760 



Search completed: September 29, 2004, 18:19:47 
Job time : 99.6748 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run September 29, 2004, 18:04:48 ; Search time 16.8874 Seconds 

(without alignments) 
3632.211 Million cell updates/sec 

Title: US-09-830-972-29 
Perfect score: 5923 

Sequence: 1 MEDLDQSPLVSSSDSPPRPQ VKDAMAKIQAKIPGLKRKAE 117 8 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 141681 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt_42 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


5815 


98. 


2 


1192 


1 


RTN4_HUMAN 


Q9nqc3 


homo sapien 


2 


4296.5 


72. 


5 


1163 


1 


RTN4 RAT 


Q9jkll 


rattus norv 


3 


915 


15. 


4 


199 


1 


RTN4_MOUSE 


Q99p72 


mus musculu 


4 


791 


13. 


4 


777 


1 


RTN1_RAT 


Q64548 


rattus norv 


5 


787.5 


13. 


3 


776 


1 


RTN1_HUMAN 


Q16799 


homo sapien 


6 


626 


10. 


6 


237 


1 


RTN3_MOUSE 


Q9es97 


mus musculu 


7 


625.5 


10. 


6 


236 


1 


RTN3_HUMAN 


095197 


homo sapien 


8 


514 


8. 


7 


545 


1 


RTN2_HUMAN 


075298 


homo sapien 


9 


500 


8. 


4 


471 


1 


RTN2 MOUSE 


070622 


mus musculu 


10 


324.5 


5. 


5 


865 


1 


CPN_DROME 


Q02910 


drosophila 


11 


320 


5. 


4 


5147 


1 


PCLO_HUMAN 


Q9y6v0 


homo sapien 


12 


315 


5. 


3 


5038 


1 


PCLO_MOUSE 


Q9qyx7 


mus musculu 


13 


299 


5. 


0 


3924 


1 


ANK2_HUMAN 


Q01484 


homo sapien 


14 


296.5 


5. 


0 


2468 


1 


MAP B_HUMAN 


P46821 


homo sapien 


15 


289 


4. 


9 


2459 


1 


MAP B_ RAT 


P15205 


rattus norv 


16 


289 


4. 


9 


2464 


1 


MAPB_MOUSE 


P14873 


mus musculu 


17 


287 


4. 


8 


5120 


1 


PCLO_CHICK 


Q9pu36 


gallus gall 



18 


7 ft A S 


fi 


• o 


1 64 A 
o Oft ft 


1 


MTlTrn MAT T C C 1 

JYllINU jyiUUorj 


Q62504 


mus musculu 


1 Q 


Z / O 


A 
4 


n 
. / 


jUoj 


1 


irOliU KAl 


Q9 j ks6 


rattus norv 


9 D 


9 7^ ^ 
z i o . D 


A 

4 ■ 


. t> 


O O O 4 


1 


MINI HUMAN 


Q96t58 


homo sapien 


21 


^ D ft * O 


A 

fl - 


. D 


ft R 4 R 
0 O ft D 


1 


AJMUl L-Arjrilt 


Q9n4m4 


caenorhabdi 


22 


96"3 ^ 


A 

ft . 


A 
. ft 




1 




P13611 


homo sapien 




t-Di , j 


A 

*± < 


A 
. ft 


66^9 


1 


TTMQ Q P7\PTTT 


001761 


caenorhabdi 


9 4 
ft 


96n r 


A 

4 . 


. fi 


"5 "5 ft 1 


1 


BOV1N 


P81282 


bos taurus 


25 


9 SQ S 


A 

ft . 


A 
• ft 


1 R9ft 


1 


MAirZ MUUbHj 


P20357 


mus musculu 


9 6 


9 ^ft ^ 

Z J O . D 


A 

4 - 


> fi 


9 ft n r 
z o uo 


1 


> jf 7\ "D 7\ tJT T"NjT7\ XT 

MAFA HUMAN 


P78559 


homo sapien 


97 


9 R Q 


4 . 


/I 


1 o 1 b 


1 


PzUO MYCGE 


Q49429 


mycoplasma 


9 ft 


9^7 R 


fi . 


Q 

. O 


1 / O 1 


1 


AK 1 z HUMAN 


Q02952 


homo sapien 


2 9 


9 S7 R 


A 

ft • 




9 7 ^ ft 

Z / -J O 


1 


JrbUV KAl 


Q9erb4 


rattus norv 


-J u 


Z Oo 


fi . 




1 1 on 

Hoy 


1 


YJnb YrlAST 


P47035 


saccharomyc 




oto 
Z OZ 


A 

fi . 


o 

sJ 


/l Q "7 7 
flo / / 


1 


ANKo HUMAN 


Q12955 


homo sapien 


17 


i J U » J 


/I 

ft . 


9 
Z 


1 ft 61 


1 


"M"A "D O n a rn 

MArZ KAl 


P15146 


rattus norv 




9 4 Q R 
Z fi . D 


A 

ft . 


Z 


i dot 
1 oZ / 


1 


MAPz HUMAN 


P11137 


homo sapien 


^ 4 

O fi 


9 A Q 

z 4 o 


A 

4 . 


z 


o4Z 1 


1 


TEGU HSVEB 


P28955 


equine herp 




0 A A 
Z fi fl 


/I 

4 • 


"1 


loOZ 


1 


BRD4 HUMAN 


060885 


homo sapien 


O D 


9 A 9 

Z 4Z 


A 

4 . 


1 


i t o n 

i / y o 


1 
1 


US01 YELAST 


P25386 


saccharomyc 


^7 


z o y 


4 . 


U 


1/44 


1 


TANA XENLA 


Q01550 


xenopus lae 


^ft 


9 ^ ft ^ 


A 

ft . 


n 
U 


1 A T 1 
1411 


1 


TCOF HUMAN 


Q13428 


homo sapien 




9^6 R 


A 

fi • 


u 




1 


rbLV MOUSE 


Q62059 


mus musculu 


40 


235.5 


4 . 


o 


1723 


1 


ATM1 HTIMAM 


/-\Q, r /1 1,-1 

y y y4 ki 


homo sapien 


41 


233.5 


3. 


9 


1087 


1 


NFH_M0USE 


P19246 


mus musculu 


42 


232 


3. 


9 


1140 


1 


YM96_YEAST 


Q04893 


saccharomyc 


43 


231 


3. 


9 


3256 


1 


KI67_HUMAN 


P46013 


homo sapien 


44 


229.5 


3. 


9 


1367 


1 


AM YH_ YEAST 


P08640 


saccharomyc 


45 


228.5 


3. 


9 


1306 


1 


MSB2_YEAST 


P32334 


saccharomyc 



ALIGNMENTS 



RESULT 1 
RTN4_HUMAN 

ID RTN4_HUMAN STANDARD; PRT; 1192 AA. 

AC Q9NQC3; 094962; Q9BXG5; Q9H212; Q9H3I3; Q9UQ42; Q9Y293; Q9Y2Y7; 

AC Q9Y5U6; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Neuroendocrine-specific protein) (NSP) (Neuroendocrine specific 

DE protein C homolog) (RTN-x) (Reticulon 5) (My043 protein) . 

GN RTN4 OR NOGO OR ASY OR KIAA08 86. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RX MEDLINE=20129242; PubMed=106677 8 0 ; 

RA Prinjha R. , Moore S.E., Vinson M. , Blake S., Morrow R. , Christie G. , 

RA Michalovich D. , Simmons D.L., Walsh F.S.; 

RT "Inhibitor of neurite outgrowth in humans."; 

RL Nature 403:383-384(2000). 

RN [2] 



RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2) . 

RC TISSUE=Brain; 

RX MEDLINE=2 101 0696; PubMed=11126360 ; 

RA Tagami S., Eguchi Y., Kinoshita M. , Takeda M. , Tsujimoto Y. ; 

RT "A novel protein, RTN-XS, interacts with both Bcl-XL and Bcl-2 on 

RT endoplasmic reticulum and reduces their anti-apoptotic activity. 1 '; 

RL Oncogene 19:5736-574 6(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RX MEDLINE=20237542; PubMed=10773680 ; 

RA Yang J., Yu L., Bi A.D., Zhao S.-.Y.; 

RT "Assignment of the human reticulon 4 gene (RTN4) to chromosome 

RT 2pl4 — >2pl3 by radiation hybrid mapping."; 

RL Cytogenet. Cell Genet. 88:101-102(2000). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM 4). 

RA Jin W. -L. , Ju G. ; 

RT "Developmentally-regulated alternative splicing in a novel Nogo-A."; 

RL Submitted (NOV-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3) . 

RC TISSUE=Placenta, and Skeletal muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in human."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Fibroblast; 

RA Yutsudo M. ; 

RT "Isolation of a cell death-inducing gene."; 

RL Submitted (JUN-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RC TISSUE=Pituitary; 

RA Song H., Peng Y. , Zhou J., Huang Q. , Dai M. , Mao Y.M., Yu Y. , Xu X., 

RA Luo B., Hu R., Chen J. ; 

RT "Human neuroendocrine-specif ic protein C (NSP) homolog gene."; 

RL Submitted (JUL-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [8] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y., Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y. , 

RA Yu J., Han L.H. ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth . " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [9] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain; 

RX MEDLINE=99156230; PubMed=1004 84 85 ; 

RA Nagase T . , Ishikawa K.-I., Suyama M., Kikuno R. , Hirosawa M. , 

RA Miyajima N., Tanaka A., Kotani H., Nomura N., Ohara o.; 

RT "Prediction of the coding sequences of unidentified human genes. XII. 

RT The complete sequences of 100 new cDNA clones from brain which code 

RT for large proteins in vitro."; 

RL DNA Res. 5:355-364(1998). 

RN [10] 



RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 3). 

RC TISSUE=Brain, Ovary, Pancreas, Placenta, and Skeletal muscle; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. , Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 
RN [11] 

RP SEQUENCE FROM N.A. (ISOFORM 3) . 

RX MEDLINE=20499367; PubMed=11042152 ; 

RA Zhang Q.-H., Ye M. , Wu X.-Y., Ren S . -X . , Zhao M. , Zhao C.-J., Fu G. , 

RA Shen Y., Fan H.-Y., Lu G. , Zhong M. , Xu X.-R., Han Z.-G., Zhang J.-W., 

RA Tao J., Huang Q.-H., Zhou J., Hu G.-X., Gu J., Chen S.-J., Chen Z . ; 

RT "Cloning and functional analysis of cDNAs with open reading frames for 

RT 300 previously undefined genes expressed in CD34+ hematopoietic 

RT stem/progenitor cells."; 

RL Genome Res. 10:1546-1560(2000). 

RN [12] 

RP SEQUENCE OF 482-1192 FROM N.A. (ISOFORM 1/4). 

RC TISSUE=Brain; 

RA Mao Y.M., Xie Y., Zheng Z.H.; 

RL Submitted (MAY-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [13] 

RP SEQUENCE OF 186-1192 FROM N.A. (ISOFORM 1). 

RC TISSUE=Testis; 

RA Sha J.H., Zhou Z.M., Li J.M. ; 

RL Submitted (JAN-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [14] 

RP TOPOLOGY. 

RC TISSUE=Brain; 

RX MEDLINE=20129259; PubMed=10667797 ; 

RA GrandPre T., Nakamura F. , Vartanian T. f Strittmatter S.M.; 

RT "Identification of the Nogo inhibitor of axon regeneration as a 

RT Reticulon protein."; 

RL Nature 403:439-444(2000). 

RN [15] 

RP FUNCTION. 

RC TISSUE=Brain; 

RX MEDLINE=21069055; PubMed=112 017 42 ; 

RA Fournier A.E., Grandpre T . , Strittmatter S.M.; 

RT "Identification of a receptor mediating Nogo-66 inhibition of axonal 



RT regeneration . " ; 

RL Nature 409:341-346(2001). 

RN [16] 

RP REVIEW. 

RX MEDLINE=21888956; PubMed=11891768 ; 

RA Ng C.E.L. , Tang B.L. ; 

RT "Nogos and the Nogo-66 receptor: factors inhibiting CNS neuron 

RT regeneration . " ; 

RL J. Neurosci. Res. 67:559-565(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults. 

cc Isoform 2 reduces the anti-apoptotic activity of Bcl-xl and Bcl-2. 

CC This is likely consecutive to their change in subcellular 

cc location, from the mitochondria to the endoplasmic reticulum, 

CC after binding and sequestration. 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum. Anchored to the membrane of the endoplasmic reticulum 

CC through 2 putative transmembrane domains. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=RTN 4A, Nogo-A, RTN-xL; 

CC IsoId=Q9NQC3-l; Sequence=Displayed; 

CC Name=2; Synonyms=RTN 4B, Nogo-B, RTN-xS, Foocen-M; 

CC IsoId=Q9NQC3-2; Sequence=VSP_005655 ; 

CC Name=3; Synonyms=RTN 4C, Nogo-C, Foocen-S; 

cc IsoId=Q9NQC3-3; Sequence=VSP_005652 , VSP 005653; 

CC Name=4 ; 

CC IsoId=Q9NQC3-4; Sequence=VSP_005654 ; 

CC -!- TISSUE SPECIFICITY: Isoform 1 is specifically expressed in brain 

CC and testis and weakly in heart and skeletal muscle. Isoform 2 is 

CC widely expressed excepted for the liver. Isoform 3 is expressed in 

CC brain, skeletal muscle and adipocytes. Isoform 4 is testis- 

CC specific. 

CC SIMILARITY: Contains 1 reticulon domain. 

CC -!- CAUTION: Ref.ll sequence differs from that shown due to 

CC frameshifts in positions 1149 and 1156. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ251383; CAB99248.1; -. 

DR EMBL; AJ251384; CAB99249.1; -. 

DR EMBL; AJ251385; CAB99250.1; -. 

DR EMBL; AB040462; BAB18927.1; -. 

DR EMBL; AB040463; BAB18928.1; -. 

DR EMBL; AF148537; AAG12176.1; 

DR EMBL; AF148538; AAG12 177.1; 

DR EMBL; AF087901; AAG12205.1; 

DR EMBL; AF320999; AAG40878.1; 

DR EMBL; AF132047; AAD31021.1; -. 

DR EMBL; AF132048; AAD31022.1; -. 



DR 


EMBL; 


AB015 


DR 


EMBL; 


AF077 


DR 


EMBL; 


AF177 


DR 


EMBL; 


AB020 


DR 


EMBL; 


BC001 


DR 


EMBL; 


BC007 


DR 


EMBL; 


BC014 



639; BAA83712.1; - 

050; AAD27783.1; - 

332; AAG17976. 1; - 

693; BAA74909.1; - 

035; AAH01035.1; - 

109; AAH07109.1; - 

366; AAH14366.1; - 



Query Match 98.2%; Score 5815; DB 1; Length 1192; 

Best Local Similarity 97.2%; Pred. No. 1.8e-215; 

Matches 1160; Conservative 4; Mismatches 13; Indels 16; Gaps 3 

QY 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 

M I M I I I I I I I I I I II M I I I I II I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | M | | 
Db 1 MEDLDQSPLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPA 60 



QY 61 AGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVPAP 12 0 

M II I I I I I I I II II I I I I I I I I I I I I I I I I I M | | | | | | | | | | | M I I I I I II I 

Db 61 AGL S AAP VP T AP AAGAPLMD FGND FVP PAP RGP L P AAP P VAP E RQ P S WD P S P VS S T VPAP 12 0 

Qy 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

M I I I I I I I I II I I I M I I I I II I I I II I I I I I I I I I I I II I I M I I I I I I I I I I I I M | 
Db 121 SPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKRRG 180 

Qy 181 SSGA WXXXXKIMDLKEQPGNTI SAGQEDFPSVLLETAASXPSLS P 226 

HI' I : : I I II I I I I I I I I II I I I I I I II I I II I I II I I I 

Db 181 SSGSVDETLFALPAASEPVIRSSAENMDLKEQPGNTISAGQEDFPSVLLETAASLPSLSP 240 

Qy 227 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 286 

M I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | 
Db 241 LSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEM 300 

Qy 287 GS S FSVS PKAES AVI VANPREEI I VKNKDEEEKLVSNNI LHXQQELPTALTKLVKEDEVV 346 

I I I I I I I I I I I I II I I M I I I I I I II I I II II I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 301 GSS FSVS PKAES AVI VANPREEI I VKNKDEEEKLVSNNI LHNQQELPTALTKLVKEDEW 360 

Qy 347 S S EKAKDS FNEKRVAVEAPMREEYADFKP FERVWEVKD S KEDS DMLAAGGKI ESNLES KV 406 

I I M I I I I I M I I I I I I I I II I I I I I II I I I I I I I I II II I I I || | | | | | | | | | | | | | | | 
Db 361 S S EKAKDS FNEKRVAVE1APMREEYADFKP FERVWEVKD S KEDS DMLAAGGKI ESNLES KV 420 

Qy 407 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 466 

I I I I I I I I I I I I I I I I I I I II I I II I I I I II I I I I I I I I I I I II I I I II I I M II I I I I I 
Db 421 DKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIF 480 



Qy 4 67 PLLEDPTSENXTDEKKIEEKKAQIWEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTE 52 6 

Ml I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I I I I I I I I I I 

Db 481 PLLGDPTSENKTDEKKIEEKKAQIVTEKNTSTKTSNPFLVAAQDSETDYVTTDNLTKVTE 54 0 

Qy 527 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 586 

I I I M I M I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 

Db 541 EWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSF 600 

Qy 587 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 646 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 601 EESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHEPENPPPYE 660 



Qy 



647 EAMSVSL-KVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSDYSE 705 



Db 


661 


Qy 


706 


Db 


721 


Qy 


766 


Db 


781 


Qy 


826 


Db 


841 


Qy 


886 


Db 


901 


Qy 


946 


Db 


960 


Qy 


1006 


Db 


1020 


Qy 


1066 


Db 


1080 


Qy 


1126 


Db 


1140 



I I I I I I I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 



N I I I I I M I I I II I I I I I I I I I I I I I I I I M I I I I II II I I I I I I I I I II I I II 



780 



EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 825 
M I I I I I I I I I I I I I I I I I I II I M I II I I I I I I II I I I I I I I I I II I I I I I I I I I I I II 
EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVY 84 0 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I | | | 



885 
900 



ANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGH 945 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I M I I I | | 



959 



I I I I I I I I M I I I I I I M I I I I I I I I I I II I I I I I | | | : | || | | | | | | | | | | | | | | | 



I M I I I I I I I I I I I I I M M I M I I II I I I II I I I I I I I I I || I I I I I I I I M | | | M II 



EVAISEELVQKYSNSALGHWCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1125 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I M I I I I | | | 
EVAI S EELVQKYSNSALGHVNCTI KELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLT 1139 



I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M | | | | | | 



RESULT 2 
RTN4_RAT 

ID RTN4_RAT STANDARD; PRT; 1163 AA. 

AC Q9JK11; Q9JK10; Q9R0D9; Q9WUE9; Q9WUF0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) (Foocen) 

DE (Glut4 vesicle 20 kDa protein) . 

GN RTN4 OR NOGO. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND PARTIAL SEQUENCE. 

RC STRAIN=Sprague-Dawley; TISSUE=Adipocyte ; 

RX MEDLINE=99249816; PubMed=10231557 ; 

RA Morris N.J., Ross S.A., Neveu J.M. , Lane W.S., Lienhard G.E.; 

RT "Cloning and characterization of a 22 kDa protein from rat adipocytes: 

RT a new member of the reticulon family."; 



RL Biochim. Biophys. Acta 1450:68-76(1999). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORMS 1; 2 AND 3). 

RX MEDLINE=20129258; PubMed=1066779 6 ; 

RA Chen M.S., Ruber A.B., Van der Haar M.E., Frank M. , Schnell L., 

RA Spillmann A.A., Christ F., Schwab M.E.; 

RT "Nogo-A is a myelin-associated neurite outgrowth inhibitor and an 

RT antigen for monoclonal antibody IN-1."; 

RL Nature 4 03:434-439(2000). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS 2 AND 4). 

RC STRAIN=Wistar Kyoto; TISSUE=Vascular smooth muscle; 

RA Ito T., Schwartz S.M.; 

RT "Cloning of a member of the reticulon gene family in rat: one of two 

RT minor splice variants."; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP FUNCTION. 

RX MEDLINE=22033691; PubMed=12 037567 ; 

RA GrandPre T., Li S., Strittmatter S.M.; 

RT "Nogo-66 receptor antagonist peptide promotes axonal regeneration."; 

RL Nature 417:547-551(2002). 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 

CC block the regeneration of the nervous central system in adults (By 

CC similarity) . 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 

CC membrane of the endoplasmic reticulum through 2 putative 

CC transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=l; Synonyms=Nogo-A, NI-220-250; 

cc IsoId=Q9JKll-l; Sequence=Displayed; 

CC Name=2; Synonyms=Nogo-B, Foocen-Ml; 

CC IsoId=Q9JKll-2; Sequence=VSP_005658 ; 

CC Name=3; Synonyms=Nogo-C, VP20; 

CC IsoId=Q9JKll-3; Sequence=VSP_005656, VSP_005657; 

CC Name=4; Synonyms=Foocen-M2 ; 

CC IsoId=Q9JKll-4; Sequence=VSP_005659 ; 

CC -!- TISSUE SPECIFICITY: Isoforms l f 2 and 3 are present in optic 

CC nerve, spinal cord and cerebral cortex. Isoforms 1 and 2 are 

CC present in dorsal root ganglion, sciatic nerve and PC12 cells 

CC after longer exposure. Isoforms 2 and 3 are detected in kidney, 

CC cartilage, skin, lung and spleen. Isoform 3 is expressed at high 

CC level in skeletal muscle. In adult animals isoform 1 is expressed 

CC mainly in the nervous system. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 



EMBL; AF051335; AAF01564.1; 
EMBL; AJ242961; CAB71027.1; 
EMBL; AJ242962; CAB71028.1; 
EMBL; AJ242963; CAB71029.1; 
EMBL; AF132045; AAD31019.1; 
EMBL ; AF132046; AAD31 020.1; 
GO; GO:0030176; C:integral to endoplasmic reticulum membrane; IDA. 
GO; GO: 0005635; Crnuclear membrane; ISS. 
GO; GO: 0005515; Fcprotein binding; ISS. 

GO; GO:0019987; Prnegative regulation of anti-apoptosis ; ISS. 
GO; GO: 0030517; Prnegative regulation of axon extension; ISS 
InterPro; IPR003388; Reticulon. 
Pfam; PF02453; Reticulon; 1. 
PROSITE; PS50845; RETICULON; 1. 

Endoplasmic reticulum; Alternative splicing; Transmembrane. 



DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
VARSPLIC 

VARSPLIC 



VARSPLIC 

VARSPLIC 

CONFLICT 
SEQUENCE 



990 
1011 
1105 
1126 
976 
33 
73 
140 



989 
1010 
1104 
1125 
1163 
1163 
46 
76 
145 
964 



CYTOPLASMIC (Potential) 
POTENTIAL. 

LUMENAL (Potential) . 
POTENTIAL. 



CYTOPLASMIC (Potential). 
RETICULON. 
POLY-GLU. 
POLY-ALA. 
POLY- PRO. 

Missing (in isoform 3) . 
/FTId=VSP_005656. 

AVLSAELSKTS -> MDGQKKHWKDK (in isoform 
3) . 

/FTId=VSP_005657. 
Missing (in isoform 2) . 
/FTId=VSP_005658. 
Missing (in isoform 4) . 
/FTId=VSP_005659. 
1130 1131 MISSING (IN REF. 3; AAD31020) . 

1163 AA; 126386 MW; 8CB894B09E94F0B6 CRC64; 



965 



173 



192 



975 



975 



975 



Query Match 72.5%; Score 4296.5; DB1; 

Best Local Similarity 74.0%; Pred. No. 2e-157; 
Matches 885; Conservative 104; Mismatches 156; Indels 



Length 1163; 

51; Gaps 



19; 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



1 MEDLDQSPLVSSS-DSPPRPQPAFKYQFVREPEDEE-EEEEEEEEDEDEDLEELEVLERK 58 

n 111:111 Mill MUM MINIM I f I I I f M I M M I : M II I M M M I M 

1 MEDIDQSSLVSSSTDSPPRPPPAFKYQFVTEPEDEEDEEEEEDEEEDDEDLEELEVLERK 60 

59 P AAG L S AAP VP T AP AAGAP LMD F GN D FVP PAP RG P L P AAP P VAP E RQ P S W D P S P VS S T VP 118 

IIMIMI II I I I I M :\ I :| I || I II II I M M I I I II M II : M : I 
61 PAAGLSAAAVP- - P AAAAPLLDFS SDSVPPAPRGPLPAAPPAAPERQPSWERS PAA P 115 

119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAPAPAAPPSTPAAPKR 178 

_ III I II I | M I I I I II II I M II M II II II | M I I I I I I I I I I 

116 APSLPPAAAVLPSKLPEDDEPPARPPPPPPAGASPLAE PAAPPSTPAAPKR 166 

179 RGSSG AWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSL 224 

' ' ' : I I I I I I I M I I : I M M I I II I II I I II I II I 

167 RGSGSVDETLFALPAASEPVIPSSAEKIMDLMEQPGNTVSSGQEDFPSVLLETAASLPSL 226 

225 SPLSAAS EKEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYS 284 



II LilllLll III LL _ ll 1 1 : : 1 : : I M I I : I : I : : | | | | M I I I , I 

ELEYS 28 6 



227 SPLSTVSFKEHGYLGNLSAVSSSEGTIEETLNEASKELPERATNPFVNRDLAEFS 



285 EMGSSFSVSPKAESAVIYANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDE 344 
I I I I I I I I I I I I : : I I : I I : | | : : | | : | : | | : | | | | | , , , 

287 EMGSSFKGSPKGESAILVENTKEEVIVRSKDKED-LVCSAALHSPQESPVG KEDR 340 

345 WSSEKAKDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDS-DMLAAGGKIESNLE 4 03 

0jl , 111 11 1 111 I I •' I I I I I I I I I | | : HIM: | | |:||| : | : , 

341 WSPEKTMDIFNEMQMSWAPVREEYADFKPFEQAWEVKDTYEGSRDVLAA RANVE 396 

4 04 SKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIAT 463 

oo I I I I : I I Mill: UN I : I I II I I I I : M | II I I I I I : I I I I 

397 SKVDRKCLEDSLEQKSLGKDSEGRNEDASFPSTPEPVKDSSRAYITCASFT-SATESTTA 455 

4 64 NIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

„ cc 1 111111 1111 II I MM 1:11 I |:| | | || I | | | | | M INI Ml I II |:| 

456 NTFPLLEDHTSENKTDEKKIEERKAQIITEK-TSPKTSNPFLVAVQDSEADYVTTDTLSK 514 



524 
515 



VTEEWANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEYMQESLYPAAQLC 583 

111 1 : ' ' I ' I ' I M M I I I I I I M I I I I I M : I II II I I :IMIII MM 

VTEAAVSNMPEGLTPDLVQEACESELNEATGTKIAYETKVDLVQTSEAIQESLYPTAQLC 



574 



575 



584 PSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEA-SSVNYESIKHEPENP 642 
1 ' 1 1 ' : 1 1 I I I II I I I I I I I I I I I | :|||||||:|M Mill |:|:||l Mill 
PSFEEAEATPSPVLPDIVMEAPLNSLLPSAGASWQPSVSPLEAPPPVSYDSIKLEPENP 634 

643 PPYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPAPDFSD 702 

coc 1 I I I I I I : I : II | M Mill: M I : I I II II I || | | | || | I II I I I I I : II II : 

635 PPYEEAMNVALKALGTKEGIKEPESFNAAVQETEAPYISIACDLIKETKLSTEPSPDFSN 694 

703 YSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVMLVKESLTETSFE 762 

cnc 111:11 1 : I I : I : I I I I I | | | : II I I I M I : II I I : I M I : I I II II I | 

695 YSEIAKFEKSVPEHAELVEDSSPESEPVDLFSDDSIPEVPQTQEEAVMLMKESLTEVS-E 753 

763 SMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELST 822 

!: ::: :|!|M I I I I I M I I I :: I : || | : : : II : I M I I I I M I : I 

754 TVAQHK-EERLSASPQELGKPYLESFQPNLHSTKDA-ASNDIPTLTKKEKISLQMEEFNT 811 

823 AVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHK 882 

on „ l ! IMIN Ml :l:|:|IIIIIIMMIMIII :|:| || Ml: Ill I 

812 AIYSNDDLLSSKEDKIKESETFSDSSPIEIIDEFPTFVSAKDDS-PKLAKEYTDLEVSDK 870 

883 SEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSA 942 

'HI II MM Ml III III || ::: ||:||:| | : || : | 
871 SEIANIQSGADSLPCLELPCDLSFKNIYPK — DEVHVSDEFSENRSSVSKASISPSNVSA 928 

943 LGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKK 1002 

„ ' I I : N I I I I II M I M M II II II II | I : || : | | | || | || M I II I M 

929 L-EPQTEMGSIVKSKSLTKEAEKKLPSDTEKEDRSLSAVLSAELSKTSWDLLYWRDIKK 987 

1003 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1062 
I N II I II II II II II II I II II M II II I || M M II II II M II I I II I II I II II II 
988 TGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAY 1047 

1063 LESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1122 
111 I M I II M II II II || |M I I || || M I II II II I II II I II II II 



Db 



1048 LESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFN 1107 



Qy 1123 GLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGLKRKAE 1178 

Db .... ''li'lilil! 1 11:11 111111 IIIMIIIIII: IIIIIIMIIIIII: 

Vt> 1108 GLTLLILALISLFSIPVIYERHQVQIDHYLGLANKSVKDAMAKIQAKIPGLKRKAD 1163 



RESULT 3 
RTN4_MOUSE 

ID RTN4_MOUSE STANDARD; PRT; 199 AA 

AC Q99P72; Q9CTE3; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Reticulon 4 (Neurite outgrowth inhibitor) (Nogo protein) 

GN RTN4 OR NOGO. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OX ^^^^9^"' ROdSntia; SClUr0gnathi; M -^ae; Mus. 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=3T3-L1; TISSUE=Adipocyte; 

RA Coulson A.C., Craggs P.D., Morris N.J.; 

RT "Mouse vp20/RTN4C cDNA. "; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases 
RN [2] " 
RP SEQUENCE OF 170-199 FROM N.A. 
RC STRAIN=C57BL/6J; TISSUE=Embryo; 

RX MEDLINE=21085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K. , Yoshino M. , Itoh M. , Ishii Y 

RA Arakawa T . , Hara A., Fukunishi Y. , Konno H., Adachi J., Fukuda s" 

RA Aizawa K., Izawa M. , Nishi K., Kiyosawa H., Kondo S., Yamanaka I.'' 

RA Saito T., Okazaki Y., Gojobori T., Bono H., Kasukawa T., Saito R. , 

RA Kadota K., Matsuda H.A., Ashburner M. , Batalov S., Casavant T 

RA Fleischmann W., Gaasterland T., Gissi C, King B., Kochiwa H 

fh h - f'; ^U'l^ 3 ™ Y " Nikaido Pesole G. , Quackenbush J., 

RA Schnml L.M., Staubli F. , Suzuki R. , Tomita M., Wagner L., Washio T. 

RA Sakai K. , Okido T., Furuno M. , Aono H., Baldarelli R., Barsh G 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M. F ' 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M " 

RA Gustincich S., Hill D., Hofmann M. , Hume D.A., Kamiya M. , Lee N H 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P 

RA Nordone P., Ring B., Ringwald M. , Rodriguez I., Sakamoto N 

RA Sasaki H., Sato K., Schoenbach C, Seya T., Shibata Y. , Storch K -F 

RA Suzuki H., Toyo-oka K. , Wang K.H., Weitz C. , Whittaker C, Wilming L. , 

RA Wynshaw-Bons A., Yoshida K. , Hasegawa Y. , Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection "■ 

RL Nature 409:685-690(2001) . 

CC -!- FUNCTION: Potent neurite outgrowth inhibitor which may also help 
CC block the regeneration of the nervous central system in adults (By 

CC similarity) . * 

CC -!- SUBUNIT: Binds to RTN4R. Interacts with Bcl-xl and Bcl-2 (Bv 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Anchored to the 
CC membrane of the endoplasmic reticulum through 2 putative 



cc transmembrane domains (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=l; 

CC Comment=A number of isoforms may be produced; 

CC Name=l; 

CC IsoId=Q99P72-l; Sequence=Displayed; 

CC -»- SIMILARITY: Contains 1 reticulon domain. 



CC 
CC 
CC 



letLtr, T- 7 " COpysl * ht - Tt 13 Produced through a collaboration 

between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
cr US H-^ n T P K° institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 
CC "^ff 8 a license agreement (See http://www.isb-sib.ch/announce/ 

or se nd an email to license@isb-sib . ch) 
CC _'_ 

DR EMBL; AF326337; AAK08076.1; -. 

DR EMBL; AK003859; -; NOT_ANN0TATED CDS . 

DR MGD; MGI: 1915835; Rtn4 . 

DR GO; GO: 0005783; C: endoplasmic reticulum; IDA. 

np ™ SS S 2Jnc«c ; C:inte 9 ral to endoplasmic reticulum membrane; ISS. 

DR GO; GO: 0005635; C:nuclear membrane; ISS. 

DR GO; GO: 0005515; F:protein binding; ISS. 

DR GO; GO: 0019987; P: negative regulation of anti-apoptosis ; ISS. 

DR GO; GO: 0030517; P:negative regulation of axon extension; ISS. 

DR GO; GO: 0007399; P: neurogenesis; IDA. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT DOMAIN 1 25 CYTOPLASMIC (Potential). 

FT TRANSMEM 26 50 POTENTIAL 

FT DOMAIN 51 137 LUMENAL (Potential). 

FT TRANSMEM 138 162 POTENTIAL 

FT DOMAIN 163 199 CYTOPLASMIC (Potential). 

FT DOMAIN 12 199 RETICULON. 

SQ SEQUENCE 199 AA; 22466 MW; 07BE5D580059ED9C CRC64; 

Query Match 15.4%; Score 915; DB1; Length 199; 

Best Local Similarity 97.4%; Pred. No. 4.9e-29- 

Matches 186; Conservative 2; Mismatches 3; Indels 0; Gaps 0; 
Qy 988 
Db 9 



Qy 1048 
Db 69 



KTS WDLLYWRDI KKTGWFGAS LFLLLSLT VFS I VSVTAYI ALALLS VT I S FRIYKGVI 1047 

1 I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I I M I I I I I I I I I M I I 

KDKWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 68 

QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

1 1 1 1 1 1 1 11 < I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNSTIKELRRLFLVDDLVDSLKF 128 

QY 1108 AVLMWVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 

100 Ill II I I II I I Ml ll:| Ml MM MM 11111111:111 IMIII 

Db 129 AVLMWVFTYVGALFNGLTLLILALISLFSIPVIYERHQAQIDHYLGLANKSVKDAMAKIQ 188 

Qy 1168 AKI PGLKRKAE 1178 

I I II I I I II I I 
Db 189 AKI PGLKRKAE 199 



RESULT 4 
RTN1_RAT 

ID RTN1_RAT STANDARD; PRT; 777 AA 

AC Q64548; Q64547; 

DT 16-OCT-2001 (Rel. 40, Created) 
DT 16-OCT-2001 (Rel. 40, Last sequence update) 
DT 10-OCT-2003 (Rel. 42, Last annotation update) 
DE Reticulon 1 (Neuroendocrine-specif ic protein) (S-rex) 
GN RTN1 OR NSP. 1 ' 

OS Rattus norvegicus (Rat) . 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
M^ a i"!;n^" ia; R ° dentia '* 3ciurognathi; Muridae; Murinae; Rattus. 



OC 
OC 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTNl-B AND RTN1-S) 

RC STRAIN=Wistar; TISSUE=Brain cortex; 

RX MEDLINE=96386034; PubMed=8793864 ; 

RA Baka I.D., Ninkina N.N., Pinon L.G.P., Adu J., Davies AM 

RA Georgiev G.P/, Buchman V.L.; * 

pt " In ^^ el i^ ar . com P artraen talization of two differentially spliced s- 

RT rex/NSP mRNAs in neurons."; 

RL Mol. Cell. Neurosci. 7:289-303(1996). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane (By 

CC similarity) . 

CC ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTNl-B ; Synonyms=S-RexB; 

cc IsoId=Q64548-l; Sequence=Displayed; 

CC Name=RTNl-S ; Synonyms=S-RexS ; 

CC IsoId=Q64548-2; Sequence=VSP_005647 , VSP 005648- 

CC -!- TISSUE SPECIFICITY: EXPRESSED PREDOMINANTLY IN CENTRAL AND 

CC PERIPHERAL NERVOUS SYSTEM OF NEWBORN AND ADULT RATS. LOW LEVELS 

CC HAVE BEEN ALSO DETECTED IN HEART, ADRENAL GLAND AND SPLEEN 

CC EXPRESSION OF ISOFORM RTNl-B IS RESTRICTED TO PARTICULAR NEURONAL 

CC TYPES • 

CC -!- DEVELOPMENTAL STAGE: DETECTED ON EMBRYONIC DAY E10 IN THE 

CC HINDBRAIN AND IN Ell IN THE FOREBRAIN. DURING THE NEXT 3 EMBRYONIC 

CC DAYS THE LEVELS OF S-REXS INCREASES AND REMAINS STABLE AT E13 IN 

CC THE HINDBRAIN AND AT El 4 IN THE FOREBRAIN. THE LEVELS OF S-REXB 

CC DOES NOT CHANGE AS SIGNIFICANTLY DURING DEVELOPMENT OF THE 

CC HINDBRAIN. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is^rod^^through^l^l^bo^tion 

CC between the Sw 1SS Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

cr ™h-^ Y h "T^ institut i°ns as long as its content is in no way 

CC ^od lfl ed and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; U17604; AAC53046.1; -. 

DR EMBL; U17603; AAC53045.1; -. 



DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane. 

FT TRANSMEM 604 624 POTENTIAL. 

FT TRANSMEM 727 747 POTENTIAL 

FT DOMAIN 590 777 RETICULON. 

FT DOMAIN 610 613 POLY-LEU. 

FT VARSPLIC 1 569 Missing (in isoform RTN1-S) . 

FT /FTId=VSP_005647. 

IPGPLGSDLVPPLPFFNKQK -> MQATADSTKMDCVWSNW 
KSQ (in isoform RTN1-S) . 
FT /FTId=VSP_005648. 

SQ SEQUENCE 777 AA; 83001 MW; AF7479C50F28D0AC CRC64; 



FT VARSPLIC 570 589 
FT 



Query Match 13.4%; Score 791; DB 1 ; Length 777; 

Best Local Similarity 26.6%; Pred. No. 1.4e-23; 

Matches 290; Conservative 129; Mismatches 308; Indels 364; Gaps 39 

Qy 134 PEDDEPPARPPPPPPASVSPQAE-PVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKI 192 

I ■ : I I I : I II || || I : | | 
Db 5 PDLQDEPLSPANPGSQLFGGRGEGEEATPKGARPAQQDGEPAWGS — GAGAGWS 57 

Qy 193 MDLKEQPGNTISAGQEDFPSVLLETA ASXP-SLSPLSAASFKEHE YLGNLSTV 244 

: : 1 11:111 I : I : I I : : | : | | : | : 

Dt> 58 SRGLCSGPARSPPVAMETASTGVAAVPDALDHSSSPTLKDGEGACYTSLISDI 110 

QY 245 L — PTE GTLQE NVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFS 291 

II III: I I : : I : : | : | | | | : | 

Db 111 CYPPREDSAYFTGILQKENGHITTSESPEELGTPGPS-LPEVPGTEPHGLLSSDSGIEMT 169 

Qy 292 VSPKAESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWSSEKA 351 

I I • I • I ' « 

Db 170 PAESTEVNKILADPLDQ 186 

Qy 352 KDSFNEKRVAVEAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVDKKCF 411 

I: I : I I I :| I I 
Db 187 MKAEACKYI DITRPQEAKGQEEQS PGL 213 

Qy 412 ADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATES— IATNIFPLL 4 69 

: I I : : I : : I I I : I I I : | I | : 

Db 214 EDKDLDFKDKDSEVSTKPEGVH APNQPSPVEGKLIKDNLF— 253 

Qy 470 EDPTSENXTDEKKIEEKKAQIVTE— KNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

1=1 H I : : : I | ll|: | | : : 
Db 25 4 EESTFAPYIDELSDEQHRMSLVTAPVKITLTEIGPPVMTATHET 297 

Qy 52 8 WANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFE 587 

: 1 1 Ml | | : | : 

Db 298 IPE KQDL CLKPSPDTVPTVT 317 

Qy 588 ESEA TPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSP LEAS S VN YES I KH 637 

11 =111 I : I I I I : | | | | :: | | : 
Db 318 VSEPEDDSPGSVTP PSSGTEPSAAESQGKGSVSEDELIAAI KEAKGLSYET 368 

Qy 638 EPENPPPYEEAMSVSLKVSGIKEEIKE PENINAALQETEAPYISIACDLIKET 690 

1:1 1 :| : ::l I ::: |: | :|: | 



Db 369 -TESPRPVGQAAD RPKVKARSGLPTIPSSLDQEASSAESGDSEI — ELVSED 417 

Qy 691 KLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFSDDSIPDVPQKQDETVM 750 

I M III I I | II : • . 

Db 418 PMASEDALP-SGYVSFGHVSGPPP SPAS PSIQYSILREEREAE 459 

QY 751 LVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK 810 

1 I : I : I Is:: II II: | | • • | | . . . 

Db 460 LDSELIIESCDAS SASEESPKREQDSPPM--KPGV LD AIREETSSRATE 506 

QY 811 EKIPLQMEELSTAVYSNDDLFISKEAQIRETETFSDSSPIE IIDEFPTLISSKTDSF 867 

I : I I I : I I : I : | : : 

Db 507 ERAP SHQGPVEPDPILSFTPVTLQSRPE — 534 

QY 8 68 SKLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNG 927 

Db s« ' 111 I: I I II: I |:: 

535 PSSGDGAPVPEPP KSQQQKPEEEAVSS— SQSP 565 

Qy 928 SATSKVLLLPPDVSALGHTQAEI ES I VKPKVLEKEAEKKLPS DTEKEDRS PSAI FSADLG 987 

: : I : I | : 

Db 566 AAT EIPGPLGSDLVPPLPFFN 586 

Qy 988 KTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVI 1047 

co „ 1 : I I I M I II I : I I : I I I : I II I I I I i : I I I | | s | | | M | | M | | i | . . 
587 KQKAI DLLYWRDI KQTGI VFGS FLLLLFSLTQFS WSWAYLALAALSATI S FRI YKSVL 646 

Qy 1048 QAIQKSDEGHPFRAYLESEVAISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKF 1107 

I I : I I : I I I I I I : I I I I I : : I : | : | | | : : : | | | : || I I M I I I I I I I I I I I 
Db 647 QAVQKTDEGHPFKAYLELEITLSQEQIQKYTDCLQLYVNSTLKELRRLFLVQDLVDSLKF 706 

1108 AVIMflVFTYVGALFNGLTLLILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQ 1167 
, 1 1 1 1 1 = I I I I I I I I I I I I I : : I : : | : | : : M : I : I I I I : | I | : : : | | | I 

7 07 AVLMWLLTYVGALFNGLTLLLMAWSMFTLPWYVKHQAQVDQYLGLVRTHINTWAKIQ 766 



Db 



Qy 

Db 



Qy 1168 AKI PGLKRKAE 1178 

I I I I I II I I 
Db 767 AKIPGAKRHAE 777 

RESULT 5 
RTN1_HUMAN 

ID RTN1_HUMAN STANDARD; PRT; 77 6 AA 

AC Q16799; Q16800; Q16801; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon 1 (Neuroendocrine-specif ic protein) 

GN RTN1 OR NSP. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS RTN1-A; RTN1-B AND RTN1-C) 

RC TISSUE=Lung carcinoma; 

RX MEDLINE=93293865; PubMed=7685762 ; 

RA Roebroek A.J.M., Van de Velde H.J.K., Van Bokhoven A., Broers J.L.V., 



RA 
RT 
RT 



RA Ramaekers F.C.S., Van de Ven W.J.M.; 

RT "Cloning and expression of alternative transcripts of a novel 

RT neuroendocrine-specific gene and identification of its 135-kDa 

RT translational product."; 

RL J. Biol. Chem. 268:13439-134 47(1993) 

RN [2] 

RP ALTERNATIVE SPLICING. 

RX MEDLINE=96429995; PubMed=8833145 ; 

RA Roebroek A.J.M., Ayoubi T.A.Y., Van de Velde H.J.K. 

Schoenmakers E.F.P.M., Pauli I.G.L., Van de Ven W.j!m ; 

"Genomic organization of the human NSP gene, prototype of a novel gene 
family encoding reticulons . " ; 

RL Genomics 32:191-199(1996). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX MEDLINE=98228245; PubMed=95604 66 ; 

RA Hens J., Nuydens R. , Geerts H., Senden N.H., Van de Ven W J M 

RA Roebroek A. J., van de Velde H.J.K., Ramaekers F.C., Broers J.l!; 

RT "Neuronal differentiation is accompanied by NSP-C expression "; ' 

RL Cell Tissue Res. 292:229-237(1998). 

CC -!- FUNCTION: May be involved in neuroendocrine secretion or in 

CC membrane trafficking in neuroendocrine cells. 

CC -!- SUBCELLULAR LOCATION: Endoplasmic reticulum membrane. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event-Alternative splicing; Named isoforms=3; 

CC Name=RTNl-A; Synonyms=NSP-A; 

cc IsoId=Q16799-l; Sequence=Displayed; 

CC Name=RTN 1 - B ; Synonyms=NSP-B; 

cc IsoId=Q16799-2; Sequence-VSP_005644 ; 

CC Name=RTN 1 - C ; Synonyms =NSP-C; 

CC IsoId=Q16799-3; Sequence=VSP_005645, VSP 005646; 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN NEURAL AND NEUROENDOCRINE TISSUES 

CC AND CELL CULTURES DERIVED THEREFROM. EXPRESSION OF ISOFORM RTN1-C 

CC IS STRONGLY CORRELATED WITH NEURONAL DIFFERENTIATION. 

CC -!- PTM: Isoforms RTNl-A and RTN1-B are phosphorylated 

CC SIMILARITY: Contains 1 reticulon domain. 



CC 



DR 
DR 
DR 
DR 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L10333; AAA59950.1; 
DR EMBL; L10334; AAA59951.1; 
DR EMBL; L10335; AAA59952.1; 
DR PIR; A46583; A46583. 
DR PIR; 160904; 160904. 
DR Genew; HGNC: 10467; RTN1. 
DR MIM; 600865; 

GO; GO:0030176; C:integral to endoplasmic reticulum membrane; TAS . 
GO; GO: 0004871; F: signal transducer activity; NAS . 
GO; GO: 0030182; P:neuron differentiation; TAS. 
GO; GO: 0007165; P: signal transduction; NAS. 
DR InterPro; IPR003388; Reticulon. 



DR 


Pfam; PF02453; Reticulon; 


1. 




DR 


PROSITE; 


PS50845; 


RETICULON; 


1. 


KW 


Endoplasmic reticulum; Alternative splicing; Transmembrane; 


KW 


Phosphorylation . 








FT 


TRANSMEM 


603 


623 




POTENTIAL. 


FT 


TRANSMEM 


726 


746 




POTKNTT AT, 


FT 


DOMAIN 


589 






Kill ItU hUN . 


FT 


DOMAIN 


609 


612 




POLY-LEU . 


FT 


VARSPLIC 


1 


420 




Missing (in isoform RTN1-B) . 


FT 










/FTId=VSP_005644. 


FT 


VARSPLIC 


1 


568 




Missing (in isoform RTN1-C) . 


FT 










/FTId=VSP 005645. 


FT 
FT 


VARSPLIC 


569 


588 




GPGPLGPGAPPPLLFLNKQK -> MQATADSTKMDCVWSNW 










KSQ (in isoform RTN1-C) . 


FT 










/FTId-VSP 005646. 


SQ 


SEQUENCE 


776 AA; 


83617 


MW; 


CA5B6232353096FE CRC64; 



Query Match 13.3%; Score 787.5; DB1; Length 776; 

Best Local Similarity 31.9%; Pred. No. 2e-23; 

Matches 229; Conservative 91; Mismatches 190; Indels 207; Gaps 25 

Qy 588 ESEATPSPVLPDI— VMEAPLNSA VPSAGASVIQPSSSPLE— ASSVNY 632 

1 ' : • II: I : I : : | | : | : | 

Db 141 EELGTPGPSLPDVPGIESRGLFSSDSGIEMTPAESTEVNKILADPLDQMKAEAYKYIDIT 200 

Qy 633 — ESIKHEPENPPPYEEA MSVSLKVSGIKEEIKEPENINAAL QET 675 

I : I I : : : I I : : | : | | : : | | | : : . , 

Db 201 RPEEVKHQEQHHPELEDKDLDFKNKDTDI S IKPEGVREPDK- PAPVEGKI I KDHLLEEST 259 

Qy 676 EAPYI S IACDL 1 KETKL S AEPAPDFS DYS EMAKVEQPVPDHS ELVED 722 

MM M : I I ||: : :: .. 

Db 260 FAPYID DLSEEQRRAPQITTPVKITLTEIEPSVE TTTQEKTPEKQDICLK 309 

QY 723 SSPDSEPVDLFS DDS IPDVPQKQ DETVMLVKESLTETSF 761 

I I I : I I III I : | | | : : | | : 
Db 310 PSPDTVPTVTVSEPEDDSPGSITPPSSGTEPSAAESQGKGSISEDELITAIKEA 363 

Qy 762 ESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELS 821 

: 1 1 1 : : I III II::: 

Db 364 - KGLS YETAENPRPVGQLADRP EVKARSGPPTIPSPLDHEA 403 

Qy 822 TAVYSNDDLFISKEAQIRETETFSDSSP1EIIDEFP TLISSKTDSFS 868 

:: 1 I I I I : : I I : I I I 

Db 404 SSAESGD-- SEIELVSEDPMAAEDALPSGYVSFGHVGGPPPSP 444 

Q y 869 KLAREYTDLEVSHKSEIANAPDGAGSLPCTELPHDLSLKNIQPKVEEKISFSD 921 

: M : I : : I : : | : I I I I I I 

Db 445 ASPSIQYSILREEREAELDSELIIESCDASSAS EES PKREQDS P PMKP SALD 496 

Qy 922 DF SKNGSATSKVLL LPPDVSALGHTQAEIESIVKP 956 

MM I II | || | | • : | 

Db 497 AIREETGVRAEERAPSRRGLAEPGSFLDYPSTEPQPGPELPPGDGAL EPETPMLP 551 

Qy 957 KVLEKEAEKKLPSDTEKEDRSPSA 1 FSADLGKT S WDLLYWRDI K 1001 

: I : : M I : I : | || : I || | M M I 
Db 552 RKPEEDSSSNQSPAATKGPGPLGPGAPPPLLF LNKQKAIDLLYWRDIK 599 



Qy 


1002 


KTGWFGASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRA 1061 
: 1 1 : 1 M : 1 II III 1 1 : 1 II 1 1 : 1 1 1 II MINIM 1 : 1 1 : M : II 1 1 II : 1 
QTGIVFGSFLLLLFSLTQFSWSWAYLALAALSATISFRIYKSVLQAVQKTDEGHPFKA 659 


Db 


600 


Qy 


1062 


Y L E S E VAI SEE L VQ K Y S N SAL GH VN CT I KE L RRL FL VD D LVD S L K FAVLMWVFT YVGAL F 1121 

lit I* * 1 * 1 ■III** * 1 1 1*111111111 1 1 1 1 1 I 1 1 1 1 1 I 1 . Illilll 

ill I • - 1 ■ 1 -ill*- • 1 1 1 • I 1 1 M M 1 1 M M M II 1 M 1 M 1 M II II 
YLELEITLSQEQIQKYTDCLQFYVNSTLKELRRLFLVQDLVDSLKFAVLMWLLTYVGMF 719 


Db 


660 


Qy 


1122 


NGLTLLI LALI SLFSVPVI YERHQAQI DH YLGLANKNVKDAMAKIQAKI PGLKRKAE 1178 
1 M 1 1 1 : : 1 : : 1 : 1 : M 1 : 1 : II II M 1 1 1 1 : : : M 1 1 II 1 M II M 
NGLTLLLMAWSMFTLPVVYVKHQAQIDQYLGLVRTHINAWAKIQAKIPGAKRHAE 776 


Db 


720 



RESULT 6 
RTN3_MOUSE 

ID RTN3_MOUSE STANDARD; PRT; 237 AA. 

AC Q9ES97; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 3. 

GN RTN3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Qiang H., Yuan J. , Qiang B.; 

RT "Cloning and expression profile of a novel mouse cDNA encoding a human 

RT RTN3 homolog."; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Eye; 

RX MEDLINE=22388257; PubMed=1247 7 932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J . , Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 



CC reticulum (Potential) . 

CC -!- SIMILARITY: Contains 1 reticulon domain. 



CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF195940; AAG31360.1; 

DR EMBL; BC014697; AAH14697.1; 

DR MGD; MGI: 1339970; Rtn3 . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 69 8 9 POTENTIAL. 

FT TRANSMEM 167 187 POTENTIAL. 

FT DOMAIN 49 237 RETICULON. 

SQ SEQUENCE 237 AA; 25428 MW; EB60A0A7AC4 5F0DE CRC64; 

Query Match 10.6%; Score 626; DB 1; Length 237; 

Best Local Similarity 53.2%; Pred. No. 6.9e-18; 

Matches 125; Conservative 41; Mismatches 53; Indels 16; Gaps 4; 

EKEAEKKLPS-DTEKEDRSPSAIFSAD LGKTS WDLLYWRDIKKTG 1004 

| | : II : III: Ml I M-III'MM 



Ml :| :||ll 1I::M :|: f I I I M I I I I I : I I II I I : I II : I I I I I : I I I 
FVFGTTLIMLLSLAAFSVI SVVSYLILALLSVTISFRVYKSVIQAVQKSEEGHPFKAYL] 

sevaiseelvqkysnsalghwctikelrrlflvddlwslkfavlmwvftyvgalfng: 

: : : I I I I : I : I I I : I : I I I I I : I I I I I I I I I I I : I I I I I = I M 

vtotlsseafhnymnaamvhwkalkliirlflvedlv^ 

tllilalislfsvpviyerhqaqidhylglanknvkdamakiqakipgl-krkae 117 

I I I I I I : : I I I I : : I I : : : I I I I I : I : I I : I I I I I : I I : I : I I I 



Qy 


960 


Db 


3 


Qy 


1005 


Db 


63 


Qy 


1065 


Db 


123 


Qy 


1125 


Db 


183 



RESULT 7 
RTN3_HUMAN 

ID RTN3_HUMAN STANDARD; PRT; 236 AA. 

AC 095197; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Reticulon protein 3 (Neuroendocrine-specif ic protein-like 2) (NSP- 

DE protein II) (NSPLII) . 

GN RTN3 OR NSPL2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 



OX NCBI_TaxID=960 6; 

RN [1] 

RP SEQUENCE FROM N.A. , AND TISSUE SPECIFICITY. 

RC TISSUE=Retina; 

RX MEDLINE=9926597 4; PubMed=10331947 ; 

RA Moreira E.F., Jaworski C.J., Rodriguez I.R.; 

RT "Cloning of a novel member of the reticulon gene family (RTN3) : gene 

RT structure and chromosomal localization to llql3." ; 

RL Genomics 58:73-81(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Huang X., Zhou Y., Du G. , Yuan J., Qiang B.; 

RT "Cloning and expression analysis of a cDNA encoding a novel 

RT neuroendocrine-specif ic protein-like protein 1: NSPL1."; 

RL Submitted (JAN-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain, Eye, and Lymph; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 

CC reticulum (Potential) . 

CC -!- TISSUE SPECIFICITY: WIDELY EXPRESSED WITH HIGHEST EXPRESSION IN 
CC BRAIN. THREE TIMES MORE ABUNDANT IN MACULA THAN IN PERIPHERAL 

CC RETINA. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF059524; AAC99319.1; -. 

DR EMBL; AF059529; AAD20951.1; -. 

DR EMBL; AF059525; AAD20951.1; JOINED. 



DR EMBL; AF059526; AAD20951.1; JOINED. 

DR EMBL; AF059527; AAD20951.1; JOINED. 

DR EMBL; AF059528; AAD20951.1; JOINED. 

DR EMBL; AF119297; AAD26810.1; -. 

DR EMBL; BC000634; AAH00634.1; 

DR EMBL; BC010556; AAH10556.1; 

DR EMBL; BC011394; AAH11394.1; -. 

DR EMBL; BC022993; AAH22993.1; 

DR Genew; HGNC: 104 69; RTN3 . 

DR MIM; 604249; -. 

DR GO; GO:0005615; C : extracellular space; TAS . 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Transmembrane; Endoplasmic reticulum. 

FT TRANSMEM 68 88 POTENTIAL. 

FT TRANSMEM 177 197 POTENTIAL. 

FT DOMAIN 48 236 RETICULON. 

SQ SEQUENCE 236 AA; 25609 MW; DDC6A4 544ABCDFB7 CRC64; 



Query Match 10.6%; Score 625.5; DB 1; Length 236; 

Best Local Similarity 54.8%; Pred. No. 7.1e-18; 

Matches 119; Conservative 41; Mismatches 56; Indels 1; Gaps 



1; 



Qy 

Db 



963 AEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASLFLLLSLTVFSI 1022 

II I I : : : I I I : : ! I I : I I I I I I I : I : I I I I II : 

20 AEPSAPGGGGSPGACPALGTKSCSSSCAVHDLIFWRDVKKTGFVFGTTLIMLLSLAAFSV 79 



Qy 

Db 

Qy 

Db 

Qy 

Db 



1023 VSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISEELVQKYSNSAL 1082 

: II : I : I I I I I I I I I I I I I I I I I I : I I I : I I i I I : II I : '\ I I I : I : 

80 ISWSYLILALLSVTISFRIYKSVI QAVQKSEEGHPFKAYLDVDITLSSEAFHNYMNAAM 139 

1083 GHVNCT I KELRRLFLVDDLVD S LKFAVLMWVFT YVGALFNGLTLL I LAL I S LFSVPVI YE 1142 

.1:1 : I : I I I I I : I I I I I I I II II : I I M I : I I I : I I I I I I : : I I I I : : I I 
140 VHINRALKLIIRLFLVEDLVDSLKLAVFMWLMTYVGAVFNGITLLILAELLIFSVPIVYE 199 

1143 RHQAQI DH YLGLANKNVKDAMAKI QAKI PGL- KRKAE 1178 
: : : I I I I I : I : I I : I II I I : I I : I : I I I 

200 KYKTQIDHYVGIARDQTKSIVEKIQAKLPGIAKKKAE 236 



RESULT 8 
RTN2_HUMAN 

ID RTN2_HUMAN STANDARD; PRT; 545 AA. 

AC 075298; 060509; 
DT 16-OCT-2001 (Rel. 
DT 16-OCT-2001 (Rel. 
DT 10-OCT-2003 (Rel. 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 
DE protein 1) (NSPLI) . 
GN RTN2 OR NSPLI. 
OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE FROM N.A., ALTERNATIVE SPLICING, AND TISSUE SPECIFICITY. 



40, Created) 

40, Last sequence update) 
42, Last annotation update) 

2 (Neuroendocrine-specif ic protein-like 1) 



RC TISSUE=Lung carcinoma; 

RX MEDLINE=98360096; PubMed=9693037 ; 

RA Roebroek A.J.M., Contreras B., Pauli I.G.L., Van de Ven W.J.M.; 

RT "cDNA cloning, genomic organization, and expression of the human RTN2 

RT gene, a member of a gene family encoding reticulons . " ; 

RL Genomics 51:98-106(1998). 

RN [2] 

RP SEQUENCE OF 108-545 FROM N.A. (ISOFORM RTN2-B) . 

RC TISSUE=Brain; 

RX MEDLINE=98191726; PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Endoplasmic 
CC reticulum (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=RTN2-A; 

CC Isold=0752 98-1; Sequence=Displayed; 

CC Note=Isoform RTN2-C is produced by alternative initiation at 

CC Met-341 of isoform RTN2-A; 

CC Name=RTN2-B; 

CC IsoId=075298-2; Sequence=VSP_00564 9 ; 

CC Event=Alternative initiation; 

CC Comment=2 isoforms, RTN2-A (shown here) and RTN2-C, are produced 

CC by alternative initiation at Met-1 and Met-341; 

CC -!- TISSUE SPECIFICITY: ISOFORM RTN2-C IS HIGHLY EXPRESSED IN SKELETAL 
CC MUSCLE. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF004222; AAC32542.1; -. 

DR EMBL; AF004223; AAC32543.1; -. 

DR EMBL; AF004224; AAC32544.1; 

DR EMBL; AF038540; AAC14910.1; -. 

DR Genew; HGNC: 10468; RTN2 . 

DR MIM; 603183; -. 

DR GO; GO:0030176; C:integral to endoplasmic reticulum membrane; NAS . 

DR GO; GO: 0004871; F: signal transducer activity; NAS . 

DR GO; GO: 0007165; P:signal transduction; NAS. 

DR InterPro; IPR003388; Reticulon. 

DR Pfam; PF02453; Reticulon; 1. 

DR PROSITE; PS50845; RETICULON; 1. 

KW Endoplasmic reticulum; Alternative splicing; Transmembrane; 

KW Alternative initiation. 

FT CHAIN 1 545 RETICULON PROTEIN 2, ISOFORM RTN2-A. 

FT CHAIN 341 545 RETICULON PROTEIN 2, ISOFORM RTN2-C. 

FT INIT_MET 341 341 FOR ISOFORM RTN2-C. 

FT TRANSMEM 368 38 8 POTENTIAL. 



FT TRANSMEM 4 63 4 83 POTENTIAL . 

FT DOMAIN 345 545 RETICULON. 

FT VARSPLIC 272 344 Missing (in isoform RTN2-B) 

FT / FTId=VSP_0 05649. 

SQ SEQUENCE 545 AA; 59263 MW; 97 1FD2F909E1E9E6 CRC64; 



Query Match 8.7%; Score 514; DB 1; Length 545; 

Best Local Similarity 28.2%; Pred. No. 3.7e-13; 

Matches 169; Conservative 81; Mismatches 186; Indels 164; Gaps 18; 

Qy 676 EAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPD-SEPVDL-F 733 

III III I II : I : : : I I : I I I : I I 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEEDEE — ETTSQDWGTPRELTF 64 

Qy 734 S DDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

I II I : I : I : : : I : I I I 

Db 65 SYIAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPSLGDSLESIPSLSQSPEPGRR 124 



Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

II I I I I | : | ::: | :::: | 

Db 125 GDP DTAPPSE RPLEDLRLRLDHLG WVARGTGSGE 158 



Qy 841 TETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGAGSLPCTEL 900 

: I I : I : I I I I : I I I I II 
Db 159 DSSTSSSTPLE — DEEP QEPNRLETGEAGE EL 188 



Qy 901 PHDLSLKNIQPKVEEKI SFSDDFSKNGSATSKVLLLPPDVSALGHT 94 6 

111:11 I : : I I : : : : : I I : I 

Db 189 — DLRLRLAQPSSPEVLTPQLSPGSGTPQAGTPSPSRSRDSNSGPEEPLLEEEEKQWGPL 246 

Qy 947 QAE 1 ES I VKPKVLEKEAE KKLP 968 

: I : | | : | : : | | | : | 

Db 247 EREPVRGQCLDSTDQLEFTVEPRLLGTAMEWLKTSLLLAVYKTVPILELSPPLWTAIGWV 306 



Qy 969 SDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWFGASL 1011 

: : : II I I : I : I I I I I I : I : : I I I I : 

Db 307 QRGPTPPTPVXRVLLKWAKSPRSSGVPSLSLGTyDMG-SKVADLLYWKDTRTSGWFTGLM 365 



Qy 1012 FLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEVAISE 1071 

III I I I II I I : : I I I I I I I I : I : I : I I : : I : I I : I I I : : : : : 
Db 366 VSLLCLLHFSIVSVAAHLALLLLCGTISLRVYRKVLQAVHRGDGANPFQAYLDVDLTLTR 425 



Qy 1072 ELVQKYSNSALGHVNCT I KELRRLFLVDDLVDS LKFAVLMWVFTYVGALFNGLTLLI LAL 1131 

I : : I : I : I I I I I : I I I I I I I I : I : : I : I I I : I II I I I I I I : 

Db 426 EQTERLSHQITSRWSAATQLRHFFLVEDLVDSLKLALLFYILTFVGAIFNGLTLLILGV 485 



Qy 1132 ISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGL KRKAE 1178 

I I I : : I : : I : I I I I I I I : I I : I I I : I I I II I I I I 

Db 486 IGLFTIPLLYRQHQAQIDQYVGLVTNQLSHIKAKIRAKIPGTGALASAAAAVSGSKAKAE 545 



RESULT 9 
RTN2_MOUSE 

ID RTN2_MOUSE STANDARD; PRT; 471 AA. 

AC 070622; 070620; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 



DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Reticulon protein 2 (Neuroendocrine-specif ic protein-like 1) (NSP-like 

DE protein 1) (NSPLI) . 

GN RTN2 OR NSPLI. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORMS 1 AND 2), AND TISSUE SPECIFICITY. 

RC STRAIN=FVB/N, and 129/Sv; TISSUE=Cerebellum, and Skeletal muscle; 

RX MEDLINE=98191726; PubMed=9530622 ; 

RA Geisler J.G., Stubbs L.J., Wasserman W.W., Mucenski M.L.; 

RT "Molecular cloning of a novel mouse gene with predominant muscle and 

RT neural expression."; 

RL Mamm. Genome 9:274-282(1998). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC TISSUE=Retina; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C, Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X . , Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C, Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C, 

RA Rodriguez A.C, Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- SUBCELLULAR LOCATION: Membrane-bound. Endoplasmic reticulum 

CC (Potential) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; Synonyms=Brain; 

CC IsoId=O70622-l; Sequence=Displayed; 

CC Name =2 ; Synonyms=Muscle; 

CC IsoId=O70622-2; Sequencers P_0 05 650, VSP_005651; 

CC -!- TISSUE SPECIFICITY: Expressed predominantly in neural and muscular 
CC tissues. 

CC -!- SIMILARITY: Contains 1 reticulon domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



cc 


entities requires a license agreement (See http://www.isb-sib.ch/announce/ 


cc 


or send an email to license@isb-sib . ch) . 


rc 












EMBL; AF038537; AAC14906.1 




DR 


EMBL ; AF038537; AAC14907.1 




DR 


EMBL; AF038538; AAC14908.1 




u r\ 


EMBL; AF038539; AAC14909.1 




DR 


EMBL; AF093624; AAD13195.1, 






EMBL; BC031370; AAH31370.1, 




DR 


MGD; MGI: 107612; Rtn2 . 




Ur\ 


InterPro; IPR003388; Reticulon. 


r\D 


Pfam; PF02453; Reticulon; 1. 


f)D 
U l\ 


PROSITE; PS50845; RETICULON; 1. 


r\ w 


Endoplasmic reticulum; Alternative splicing; Transmembrane. 


FT 
IT J. 


TRANSMEM 295 315 


POTENTIAL. 




DOMAIN 272 471 


RETICULON. 


FT 


VARSPLIC 1 2 67 


Missing (in isoform 2) . 


FT 




/FTId=VSP_005650. 


FT 


VARSPLIC 268 271 


PLLL -> MGSK (in isoform 2) . 


FT 




/FTId-VSP 005651. 


SQ 


SEQUENCE 471 AA; 51346 MW; 9BBD8F372CF63AD3 CRC64; 



Query Match 8.4%; Score 500; DB 1; Length 471; 

Best Local Similarity 29.3%; Pred. No. l.le-12; 

Matches 159; Conservative 83; Mismatches 177; Indels 124; Gaps 16; 

Qy 67 6 EAPYISIACDLIKETKLSAEPAPDFSDYSEMAKVEQPVPDHSELVEDSSPD-SEPVDL-F 733 

III III I I I : I : : I I I : I I I : I I 

Db 13 EAP STASSTPDSTEGGNDDSDFRELHTAREFSEDEEE — ETTSQDWGTPRELTF 64 

Qy 734 S DDSIPDVPQKQDETVMLVKESLTETSFESMIEYENKEKLSALPPEG 780 

I | | : | : | : | : : : : : | | | 

Db 65 SYIAFDGWGSGGRRDSWRRPRPQGRSVSEPRDPPQQSGLGDSLESIPSLSQSPEPGRR 124 

Qy 781 GKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRE 840 

II I : I I |:: |::::| :: : | | 

Db 125 GDP DPVPPAE RPLEELRLRLDQLG WWRSAGSGE 158 

Qy 841 TETFSDSSPIEIIDEFPTLISS KTDSFSKLARE-YTDLEV SHKSEIAN 887 

I I : I : I : I I : : : I : : I I : : III I : I : 

Db 159 DSATSSSTPLE — NEEPDGLEASEAGEETNLELRLAQSLHLQLEVLTPQLS PS SGTPQAH 216 

Qy 888 APDGAGSLPCTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ 947 

I I Mil: III : : I I : : 

Db 217 TPSPQRSQDSNSGPDDEPLLNV VEEH WRLLEQEPITAQCLDST 259 

Qy 948 AEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLYWRDIKKTGWF 1007 

: I : : : I : I I I I I I I : I : : I II 

Db 260 DQSEFMLEPLLL VADLLYWKDTRTSGAVF 288 

Qy 1008 GASLFLLLSLTVFSIVSVTAYIALALLSVTISFRIYKGVIQAIQKSDEGHPFRAYLESEV 1067 

: III I I I I I I I : : I I I I I I I : I : I : I I : : I : I I : I I I : : : 
Db 289 TGLMASLLCLLHFSIVSVAAHLALLGLCATISLRAA^RKVLQAVHRGDGTNPFQAYLDMDL 348 

Qy 1068 AISEELVQKYSNSALGHVNCTIKELRRLFLVDDLVDSLKFAVLMWVFTYVGALFNGLTLL 1127 

"I : : I II I : I I I II : I I I I I I I I : I : : I : i I I : II I I I I : 

Db 349 TLTREQTERLSQQIASHWSTATQLRHFFLVEDLVDSLKLALLFYILTFVGAI FNGLTLV 408 



Qy 1128 ILALISLFSVPVIYERHQAQIDHYLGLANKNVKDAMAKIQAKIPGL KR 1175 

I I : : : I I : I I : : I : I I I I I I I : I I : I I I : I I I I I I 

Db 409 I LGWALFTVPLLYRQHQAQI DQYVGLVTNQLSHI KAKI RAKI PGTGTLAPTASVSGS KA 468 

Qy 1176 KAE 1178 

I I I 

Db 469 KAE 471 



RESULT 10 
CPN__DROME 

ID CPN_DROME STANDARD; PRT; 865 AA. 

AC Q02910; 

DT 01-OCT-1993 (Rel. 27, Created) 

DT 01-OCT-1993 (Rel. 27, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Calphotin. 

GN CPN OR CAP. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S; 

RX MEDLINE=93165729; PubMed=8 094559; 

RA Martin J.H., Benzer S., Rudnicka M. , Miller C.A.; 

RT "Calphotin: a Drosophila photoreceptor cell calcium-binding protein."; 

RL Proc. Natl, Acad. Sci. U.S.A. 90:1531-1535(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Canton-S; 

RX MEDLINE=93165730; PubMed=84340 15 ; 

RA Ballinger D.G., Xue N., Harshman K.D.; 

RT "A Drosophila photoreceptor cell-specific protein, calphotin, binds 

RT calcium and contains a leucine zipper."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:1536-1540(1993). 

CC -!- FUNCTION: Might function as a calcium-sequestering "sponge" to 
CC regulate the amount of free cytoplasmic calcium. It binds 0.3 mole 

CC of Ca(2+) per mole of protein. 

CC -!- SUBUNIT: Homodimer (Probable). 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic; hypodense compartment. 

CC -!- TISSUE SPECIFICITY: Soma and axons of photoreceptor cells of 
CC compound eyes and ocelli. 

CC -!- DEVELOPMENTAL STAGE: Expressed early in photoreceptor cell 
CC development. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC : 



DR EMBL; L02111; AAA28405.1; -. 

DR EMBL; L05080; AAA28420.1; 

DR PIR; A47282; A47282. 

DR PIR; A47283; A47283. 

DR FlyBase; FBgn0010218; Cpn . 

DR GO; GO: 0005509; F: calcium ion binding; IDA. 

KW Calcium-binding. 

FT CONFLICT 36 3 6 A -> AVAPAWA (IN REF. 2) . 

FT CONFLICT 43 43 I -> T (IN REF. 2). 

FT CONFLICT 64 64 I -> V (IN REF. 2). 

FT CONFLICT 76 76 T -> A (IN REF. 2) . 

FT CONFLICT 100 100 P -> PP (IN REF. 2) . 

FT CONFLICT 126 127 VQ -> AP (IN REF. 2). 

FT CONFLICT 154 154 I -> V (IN REF. 2) . 

FT CONFLICT 160 160 S -> T (IN REF. 2) . 

FT CONFLICT 534 534 A -> E (IN REF. 2) . 

FT CONFLICT 699 699 I -> T (IN REF. 2) . 

FT CONFLICT 703 703 V -> L (IN REF. 2) . 

FT CONFLICT 721 721 D -> E (IN REF. 2) . 

SQ SEQUENCE 865 AA; 84781 MW; 2110417E0B0E7CFE CRC64; 

Query Match 5.5%; Score 324.5; DB 1; Length 865; 

Best Local Similarity 22.3%; Pred. No. l.le-05; 

Matches 229; Conservative 128; Mismatches 375; Indels 297; Gaps 49; 

Qy 59 PAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPLPAAPPVAPERQPSWDPSPVSSTVP 118 

I : I I I I I I II: I Ml I : I I I I : I I : I 

Db 9 PVSAPVAAPV-TPSAVAAPVQWSPAAVAPAPAAPIAVTPVAPPPTLASVQPATV — TIP 65 

Qy 119 APSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQAEPVWTPPAP APAAPP — ST 172 

I I : I : : I I : I : I II I I I I : I I : I I I I I I I I : I 

Db 66 APAPIAAASVTP VASVAPPWAAPTPPAA-SPVSTPVAVAQIPVAVSAPVAPPVAAT 121 

Qy 173 PAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLETAASXPSLSPLSAASF 232 

I : : I : I I I I : I I I : I I : I : 

Db 122 PTPWQIPVAAPVIAT PPVAASA PT PAAVT PVI S PVI AS — 160 

Qy 233 KEHEYLGNLSTVLPTEGTLQENVSEASKEVSEKAKTLLIDRDLTEFSELEYSEMGSSFSV 2 92 

I : I I | | : : | : : | : | 

Db 161 PPWPANTT VPVAAPVAAVPAAVPWAPVLAP AV 194 

Qy 293 SPKAESAVIVAN PREEIIVKNKDEEEKLVSNNILHXQQELPTALTKLVKEDEWS 347 

: I I : I 1 I I I I : | : | : | I I : 

Db 195 APAV — APWAET PAP P PVAE I P VAT 1 PECVAPLI PEVS WA 234 

Qy 348 SEKAKDSFNEKRVAV^IAPMREEYADFKPFERVWEVKDSKEDSDMLAAGGKIESNLESKVD 4 07 

: I : I I I I I I : : : | : : : I 

Db 235 T K P LAAAE PVWAP PAT ET PWAPAAAS PHVSVAPAVETAWAPVS 280 

Qy 408 KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNPAATESIATNIFP 467 

II : : I : I I I II 
Db 2 81 ASTEPPV AAATLTTAPETPAL 301 

Qy 4 68 LLEDPTSENXTDEKKIEEKKAQI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEE 527 

I : I I III II 
Db 302 APWAESQ VAA NTWATPP 320 



Qy 528 WANMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA 579 

I I I : I : I : I II : I I I : I I I I : I I 

Db 321 TPAPEPETIAPPW : AET P E VAS VAVA- ET T P P W- - P P VAAE SI- PAP WATT P V 371 

Qy 580 -AQLCPSFEESEATPSPVLPDIVMEAPLNSAVPSAGASVIQPSSSPLEASSVNYESIKHE 638 

I I : : I : I I I : : : I : I I I : I I : I I : 

Db 372 PATLAVTDPDVTASAVPELPPVIAPSPVPSAVAETPVDLAPPVLPPVAAEPVPAWAEET 431 

Qy 639 PENPPPYEEAMSV-SLKVSGIKEEIKEPENINAALQETEAPYISIACDLIKETKLSAEPA 697 

MM : : : : | : : I I : I I I : I : : I I I 

Db 432 PETPAPASAPVTIAALDIPEVAPVIAAPSDAPAEAPSAAAPIVSTPPTTASVPETTAPPA 491 

Qy 698 P DFSDYSEMAKVEQPVPDHSEL VEDSSPDSEPVDLFSDDSIP — DVPQK 744 

I I I I I : I I I I : I I : I I I : I : I 

Db 492 AVPTEPIDVSVLSE-AAIETPVAPPVEVTTEVAVADVAPPEAAADLIIEPVEPPAPIPDL 550 

Qy 745 QDETVMLVKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEV 804 

: : I : I : : : I : : I I I : : : : I 

Db 551 LEQTTSVPAVEAAESTSSPIPE TSLPPPNEAVASPEVAVAPITAPEPIPEPEP 603 

Qy 805 STLSKKEKIPLQ MEELSTAVYSNDDLFISKEAQIRETETFSDSSPIEIIDEFPTLI 860 

I : I I I : : : : I II I : I I I III III: 
Db 604 S LAT P T E P I P VEAP WI Q EAVDAV EVPVTETST SIP-ETTVEFPEAV 649 

Qy 861 SSKTDSFSKLAREYTDLEV-SHKSEIANAPDGAGSLPCTELP HDLSL 906 

: I I | : | : : : : | | | | | I I I : I : : : 

Db 650 AEKV LDPAITEAPVTTQEPDVANINDGA PAT E I T T PAVE I VT AAAE VS D I AI 701 

Qy 907 KNIQPKVEEKISFSDDFSKNGSATSKVLL LP PDVSALG 944 

I I I : : I : : : : : : : I : : : I I : I I 

Db 702 PVIDPPVPQEIAVA-EIPETDTKPAEVIVEQSTIPIEAPVPEVSKYAEPVISEAPAAEVP 760 

Qy 945 HTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGKTSWDLLY 996 

: I I : I I : I I I : I : :: III I : I : I I 
Db 761 ITAGDNPDNTSVGISEW-PTIAEKPVEEVPTSEIPEQSSSPSD — SVPVAK--ITPLL- 814 

Qy 997 WRDIKKTGV 1005 

II:: I I 
Db 815 -RDLQTTDV 822 



RESULT 11 
PCLO_HUMAN 

ID PCLO_HUMAN STANDARD; PRT; 5147 AA. 

AC Q9Y6V0; 043373; 060305; Q9BVC8; Q9UIV2; Q9Y6U9; 
DT 28-FEB-2003 (Rel. 41, Created) 
DT 28-FEB-2003 (Rel. 41, Last sequence update) 
DT 10-OCT-2003 (Rel. 42, Last annotation update) 
DE Piccolo protein (Aczonin) (Fragments). 
GN PCLO OR ACZ OR KIAA0559. 
OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI_TaxID=9606; 
RN [1] 

RP SEQUENCE OF 1-759 FROM N.A. 
RC TISSUE=Brain; 



RX MEDLINE=99439764; PubMed=10508862 ; 

RA Wang X., Kibschull M. , Laue M.M. , Lichte B., Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin."; 

RL J. Cell Biol. 147:151-162(1999). 

RN [2] 

RP SEQUENCE OF 552-4404 FROM N.A. 

RA Kraemer J., Wollam C, Wohldmann P., McGrane B. ; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 3619-5147 FROM N.A. (ISOFORM 2) . 

RC TISSUE=Brain; 

RX MEDLINE=98290545; PubMed=9628581; 

RA Nagase T., Ishikawa K.-I., Miyajima N., Tanaka A., Kotani H. , 

RA Nomura N., Ohara O.; 

RT "Prediction of the coding sequences of unidentified human genes. IX. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro. 11 ; 

RL DNA Res. 5:31-39(1998). 

RN [4] 

RP SEQUENCE OF 4405-4439 FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M. , Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J. , 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C. f Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E. f Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [5] 

RP SEQUENCE OF 4405-5147 FROM N.A. 

RA Kalicki J., Elliott G.; 

RL Submitted (FEB-1998) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 
CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking (By similarity) . 

CC SUBUNIT: Interacts with Rabacl/Pral, RIMS2 and profilin (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 
CC synaptic junctions (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 



CC Event=Alternative splicing; Named isof orms=2 ; 

CC Coinment=Additional is o forms seem to exist; 

CC Name=l; 

CC IsoId=Q9Y6V0-l ; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q9Y6V0-2; Sequence=VSP_003923 , VSP_003924, VSP_003925, 

CC VSP_003926, VSP_003927; 

CC Note=No experimental confirmation available; 

CC -'-DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; Y19188; CAB60727.1; 

DR EMBL; AC004903; AAD20936.1; -. 

DR EMBL; AC004886; AAD21789.1; 

DR EMBL; AB011131; BAA25485.1; 

DR EMBL; BC001304; AAH01304.1; -. 

DR EMBL; AC004082; AAB97937.1; -. 

DR PIR; T00634; T00634. 

DR HSSP; P04410; 1A25. 

DR Genew; HGNC: 13406; PCLO. 

DR MIM; 604918; -. 

DR GO; GO: 0005856; C : cytoskeleton ; NAS . 

DR GO; GO: 0045202; C: synaptic junction; ISS. 

DR GO; GO: 0005509; F: calcium ion binding; ISS. 

DR GO; GO: 0005544; F : calcium-dependent phospholipid binding; ISS. 

DR GO; GO: 0005522; F:profilin binding; ISS. 

DR GO; GO:0007010; P : cytoskeleton organization and biogenesis; ISS. 

DR GO; GO:0016080; P:synaptic vesicle targeting; ISS. 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001565; Synaptotagmin . 

DR PRINTS; PR00360; C2 DOMAIN . 

DR PRINTS; PR00399; SYNAPTOTAGMN . 

DR SMART; SM00239; C2 ; 2. 

DR PROSITE; PS00499; C2_DOMAIN_l; 1. 

DR PROSITE; PS50004; C2_DOMAIN_2; 2. 

KW Calcium/phospholipid-binding; Zinc; Metal-binding; Zinc-finger; 

KW Repeat; Alternative splicing. 

FT NON_TER 1 1 

FT DOMAIN 400 465 10 X 10 AA TANDEM APPROXIMATE REPEATS OF 

FT P-A-K-P-Q-P-Q-Q-P-X. 

FT ZN_FING 4 99 523 C4-TYPE (POTENTIAL) . 

FT ZN_FING 969 992 C4-TYPE (POTENTIAL) . 

FT NON_CONS 1010 1011 

FT DOMAIN 2300 2325 POLY-PRO. 

FT DOMAIN 4391 4442 PDZ . 

FT DOMAIN 4544 4633 C2 DOMAIN 1. 



5031 5121 C2 DOMAIN 2. 

4404 4404 S -> SGNGLGIRIVGGKEIPGHSGEIGAYIAKILPGGSAE 

QTGKLMEG (in isoform 2) . 

/FTId=VSP_003923. 
4534 4534 K -> KPTDGTKWSHPITGEIQ (in isoform 2) - 

/FTId=VSP_003924 . 
4576 4576 G -> GQVMWQNAS (in isoform 2). 

/FTId=VSP_003925. 
4757 4761 TAHKS -> SKRRK (in isoform 2) . 

/FTId=VSP_003926. 
4762 5147 Missing (in isoform 2) . 

/FTId=VSP_003927. 
5147 AA; 563537 MW; CD5D84990498CD3C CRC64; 



FT DOMAIN 
FT VARSPLIC 
FT 
FT 

FT VARSPLIC 
FT 

FT VARSPLIC 
FT 

FT VARSPLIC 
FT 

FT VARSPLIC 

FT 

SQ SEQUENCE 

Query Match 5.4%; Score 320; DB 1; Length 5147; 

Best Local Similarity 21.8%; Pred. No. 0.00015; 

Matches 260; Conservative 152; Mismatches 440; Indels 340; Gaps 61 

Qy 8 PLVSSSDSPPRPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAP 67 

I : I I I I I : : | | : : I I I I 

Db 260 PSLPSPSKPPIQQPTPGKPPAQQPGHEKSQPG PAKPPAQPSGLT 303 

Qy 68 VPTAPAAGA P LMD FGND FVP PAP RG P L PAAP P V 100 

II I I : I I I I I I I ! 

Db 304 KPLAQQPGTVKPPVQPPGTTKPPAQPLG — PAKPPAQQTGSEKPSSEQPGPKALAQPPGV 361 

Qy 101 — APERQPSWDPSPVSSTVPAPSPLS AAAVSPSKLPEDDEPPA RPP 144 

I : II I : I I I I : I I I : I I : I I : : I I 

Db 362 GKTPAQQPG-PAKPPTQQVGTPKPLAQQPGLQSPAKAPGPTKTPAQTKPPSQQPGSTKPP 420 

Qy 145 P — PPPASVSPQAEPVWTPPAPAP-AAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGN 201 

I I I I I I I I I : I : I I I : I I 

Db 421 PQQPGPAKPSPQQPGSTKPPSQQPGSAKPSA QQPSP 456 

Qy 202 TISAGQEDFPSVLLETAASXPSLSPLSAASFKEHEYLGNLSTVLPTEGTLQENVSEASKE 261 

: I : I : : I I I : I I : I I : I I : I 

Db 457 AKPSAQQ-FTKPVSQTGFGKPLQPPTVSPSAKQPPSQGLPKTICPL CNTTELLLH 510 

Qy 2 62 VSEKA KTLLI DRDLTEFSE LEYSEMGSSFSVSPKA 2 96 

I I I I : I : : I I I I : | : | : 

Db 511 VPEKANFNTCTECQTTVCSLCGFNPNPHLTEAKEWLCLNCQMKRALGGDLAPVPSSPQPK 570 

Qy 297 ESAVIVANPREEIIVKNKDEEEKLVSNNILHXQQEL PTALTKLVKED 343 

I I I : : I : : II I hi I I I I : 

Db 571 LKTAPVTTTSAVSKSSPQPQQTSPKKDAAPK QDLSKAPEPKKPPPLVKQP 620 

Qy 344 EWSSEKAK DS FN E K RVAVEAPMRE E YADFKPFERVWEVKDSKEDSDM 391 

: I I I II:: I : I I I I I : | | : I : 

Db 621 TLHGSPSAKAKQPPEADSLSKPAPPKEPSVPSEQDKAPVADDKP KQPKMVKPTTDL 67 6 

Qy 392 LAAGGKIESNLESKVD KKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYI 448 

: : : I : : I I | : : | : | : : | : Mill: 
Db 677 VSS SSATTKPDIPSSKVQSQAEEKTTPPLKTDSAKPSQSFPPTGEKV 723 

Qy 449 TCAPFN PAATESIATNIFPLLE DPTSENXTDEKKIEEKKAQIVTEKN 4 95 

II: II: | : : | | I : I I I I I I I 

Db 724 --TPFDSKAIPRPASDSKIISHPGPSSESKGQKQVDPV QKKEEPKKAQTKMSPK 775 



Qy 496 TSTK TSNPFFVAAQDSETDYVTTD NLTKVTEEWANMPEGLT 537 

I I . I I I : t I : I : I : I 
Db 776 PDAKPMPKGSPTPPGPRPTAGQTVPTPQQSPKPQEQSRRFSLNLGSITD APKSQPTT 832 

Qy 538 PDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQESLYPA — AQLCPSFEESEATPSP 595 

I I i III : : I I : : : I : I i : I I : I 

Db 833 P QET VTGKLFGFGASI-FSQASNLI STAGQPGPHSQSGPGAPMKQA- PAP 880 

Qy 596 VLPDIVMEAPLNSA-VPSAGASVI QPSS S PLEAS SVNYES IK — HEPENPPPY 645 

I I : : I I I I | : : | | : : : | : | | | 

Db 881 SQPPTSQGPPKSTGQAPPAPAKSIPVKKETKAPAAEKLEPKAEQAPTVKRTETEKKPPPI 940 

Qy 646 EEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIK-ETKLSAEPAPDFSDYS 704 

: : : I : : I I : : : : I I I I I : : : I : I : : 

Db 941 KDSKSLT AEPQKAVLPTKLEKSPKPESTCPLCKTELNIGSKDPPNFNTCT 990 

Qy 705 EMAKVE- QPVPDHSELVE DSSPDSEPVDLFSDDS 737 

I | | : | : | | : | : : :: | 

Db 991 ECKNQVCNLCGFNPTPHLTENCQTQRAISGQLGDIRKMPPAPSGPKASPMPVPTESSSQK 1050 

Qy 738 — IPDVPQ KQD ETVML — VKESLTETSFESMIEYENKEKLSALPPEGG 781 

: I II II: I I : I I I I : I : I : : I : : I I : 

Db 1051 TAVP--PQWLVKKQEQEVKTEAEKVILEKVKETLSMEKIPPMVTTDQKQEESKLEKDKA 1108 

Qy 782 KPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIRET 841 

I I I I I : I : I : : I I I I I : I : I : I I : 

Db 1109 SALQE — KKPLPEEK-KLIPEE-EKIRSEEKKPLLEEKKPTP EDKKLLPEAKTSAP 1160 

Qy 842 ETFSD SSPIEIIDE-FPTLISSKTDSFSKLAREYTDLEVSHKSEIANAPDGA-GSLP 896 

I I ::| :| :: I I I :::: I I III 

Db 1161 EEQKHDLLKSQVQIAEEKLEGRVAPKTVQEGK QPQTKMEGLPSGTPQSLP 1210 

Qy 897 CTELPHDLSLKNI QPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQ — 947 

: I : I I II I : | : : : : | : | : | | : : 

Db 1211 KED DKTTKTIKEQPQPPCTAKPDQEKEDDKSDTSSSQQPKSPQGLSDTGYSSDGISS 1267 

Qy 948 — AEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSADLGK — TSWDLL 995 

III:: II: II I : : : I I I : : I I I : : I : : I 

Db 1268 SLGEIPSLIPTD — EKDILKGLKKDSFSQESSPSS — PSDLAKLESTVLSIL 1315 



RESULT 12 
PCLO_MOUSE 

ID PCLO_MOUSE STANDARD; PRT; 5038 AA. 

AC Q9QYX7; Q9QYX6; Q9QZJ0; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Piccolo protein (Presynaptic cytomatrix protein) (Aczonin) (Brain- 

DE derived HLMN protein) . 

GN PCLO OR ACZ. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 
RN [1] 



RP SEQUENCE FROM N.A., SUBCELLULAR LOCATION, ALTERNATIVE SPLICING, 

RP TISSUE SPECIFICITY, AND INTERACTION WITH PROFILIN. 

RC TISSUE=Brain; 

RX MEDLINE=99439764; PubMed=10508862 ; 

RA Wang X., Kibschull M., Laue M.M., Lichte B . , Petrasch-Parwez E., 

RA Kilimann M.W. ; 

RT "Aczonin, a 550-kd putative scaffolding protein of presynaptic active 

RT zones, shares homology regions with rim and bassoon and binds 

RT profilin."; 

RL J. Cell Biol. 147:151-162(1999). 

RN [2] 

RP REVISIONS. 

RA Kilimann M.W. ; 

RL Submitted (AUG-2001) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE OF 4502-4682 FROM N.A. 

RC TISSUE=Brain; 

RA Huang W., Jin W. , Huang C, Chen B. , Zhang J., Ju G. ; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP INTERACTION WITH RIMS2 . 

RX MEDLINE=22384373; PubMed=124 01793 ; 

RA Fujimoto K., Shibasaki T., Yokoi N. , Kashima Y. , Matsumoto M. , 

RA Sasaki T., Tajima N., Iwanaga T., Seino S.; 

RT "Piccolo, a Ca2+ sensor in pancreatic beta-cells. Involvement of 

RT cAMP-GEFI I . Rim2 . Piccolo complex in cAMP-dependent exocytosis . " ; 

RL J. Biol. Chem. 277:504 97-50502(2002). 

CC -!- FUNCTION: May act as a scaffolding protein involved in the 
CC organization of synaptic active zones and in synaptic vesicle 

CC trafficking. 

CC -!- SUBUNIT: Interacts with Rabacl/Pral, RIMS2 and profilin. 

CC -!- SUBCELLULAR LOCATION: Concentrated at the presynaptic side of 

CC synaptic junctions. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q9QYX7-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q9QYX7-2; Sequence=VSP_00392 8, VSP_003929; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Low levels found in 
CC stomach. Not detected in other tissues analyzed including adrenal 

CC gland, testis and pancreas. 

CC -!- DOMAIN: C2 domain 1 is involved in binding calcium and 

CC phospholipids. Calcium binds with low affinity but with high 

CC specificity and induces a large conformational change. 

CC -!- SIMILARITY: Contains 2 C2 domains. 

CC -!- SIMILARITY: Contains 1 PDZ/DHR domain. 

CC : 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; Y19185; CAB60731.2; 



DR EMBL; Y19186; CAB60732.2; 

DR EMBL; AF181269; AAD55786.2; 

DR HSSP; P04410; 1A25. 

DR MGD; MGI: 1349390; Pclo. 

DR GO; GO: 0045202; C: synaptic junction; IDA. 

DR GO; GO: 0005509; F: calcium ion binding; ISS. 

DR GO; GO: 0005544; F: calcium-dependent phospholipid binding; ISS. 

DR GO; GO: 0005522; F:profilin binding; IDA. 

DR GO; GO: 0019933; P : cAMP-mediated signaling; IDA. 

DR GO; GO: 0007010; P : cytoskeleton organization and biogenesis; IDA. 

DR GO; GO: 0030073; P: insulin secretion; IDA. 

DR GO; GO:0017157; P:regulation of exocytosis; IDA. 

DR GO; GO: 0016080; P: synaptic vesicle targeting; NAS . 

DR InterPro; IPR000008; C2 . 

DR InterPro; IPR001478; PDZ . 

DR InterPro; IPR008899; Znf_piccolo. 

DR Pfam; PF00168; C2 ; 2. 

DR Pfam; PF00595; PDZ; 1. 

DR Pfam; PF05715; Zf_piccolo; 2. 

DR SMART; SM00239; C2 ; 2. 

DR SMART; SM0022 8; PDZ; 1. 

DR PROSITE; PS004 99; C2_DOMAIN_l; 1. 

DR PROSITE; PS50004; C2_DOMAIN_2; 2. 

DR PROSITE; PS50106; PDZ; 1. 

KW Calcium/phospholipid-binding; Metal-binding; Zinc; Zinc-finger; 

KW Repeat; Alternative splicing. 



FT 


DOMAIN 


371 


470 


10 X 10 AA TANDEM APPROXIMATE REPEATS OF 


FT 








P-A-K-P-Q-P-Q-Q-P-X. 


FT 


ZN_FING 


502 


526 


C4-TYPE (POTENTIAL) . 


FT 


ZN_FING 


967 


990 


C4-TYPE (POTENTIAL) . 


FT 


DOMAIN 


2305 


2329 


POLY-PRO. 


FT 


DOMAIN 


4394 


4488 


PDZ. 


FT 


DOMAIN 


4607 


4705 


C2 DOMAIN 1. 


FT 


DOMAIN 


4922 


5012 


C2 DOMAIN 2. 


FT 


VARSPLIC 


4829 


4833 


TKPTN -> SKRRK (in isoform 2) . 


FT 








/FTId=VSP_003928. 


FT 


VARSPLIC 


4834 


5038 


Missing (in isoform 2) . 


FT 








/FTId=VSP_003929. 


SQ 


SEQUENCE 


5038 


AA; 547600 


MW; DADA4 60CF3B40888 CRC64; 



Query Match 5.3%; Score 315; DB 1; Length 5038; 

Best Local Similarity 22.0%; Pred. No. 0.00023; 

Matches 254; Conservative 136; Mismatches 435; Indels 328; 



Gaps 57; 



Qy 

Db 

Qy 

Db 

Qy 

Db 



57 RKPA — AGL S AAP VPT AP AAGAP LMD FG NDFVPPAPR GPLPAAPPVAPE 103 

: I I I I : I II I II: | : | | : I 

288 KSPAQPAGTGKSPAQPPVTAKPPAQQAGLEKTSLQQPGPKSLAQTPGQGKVPPGPAKSPA 347 

104 RQ PSWDPSP-VSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQ-A 155 

: I I : I I : I I I I : I III I : I I I I I 

34 8 QQPGTAKLPAQQPGPQTASKVPGPTKT PAQLSGPGKTPAQQPGPTKPSPQQPIPAKPQPQ 4 07 

156 EPVWT PPAPAPAAPP STPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQED 209 

: I I I I I I I I 1 I I I I : : : I III I : 

408 QPVATKPQPQQPAPAKPQPQHPTPAKPQPQQPTPA KPQPQQPTPAKPQP 4 56 



Qy 



210 FPSVLLETAASXPSLS 



PLSA — ASFKEHEYLGNLSTVLP TEGTLQENVS 256 



I : : I I I I III I : I I : I I I 

Db 457 QHPGLGKPSAQQPSKSISQTVTGRPLQAPPTSAAQAPAQGLSKTICPLCNTTELLLHTPE 516 

Qy 257 EASKEVSEKAKTLLI DRDLTEFSEL EYSEMGSSFSVSPKA 296 

: I : : : : : : III I I : : I I : I I I 

Db 517 KANFNTCTECQSTVCSLCGFNPNPHLTEIKEWLCLNCQMQRALGGELAAIPSSPQPTPKA 576 

Qy 297 ESAVIV ANPREEIIVKN KDEEEK LVSNNILHX QQE 331 

I I : I : : I : I III II II I 

Db 577 ASVQPATASKSPVPSQQASPKKELPSKQDSPKAPESKKPPPLVKQPTLHGPTPATAPQPP 636 

Qy 332 LPTALTKLV — KEDEWSSEKAKDSFNEKRVAVEAPMREEYADFKPFERWEVKDSKEDS 389 

: I I I I : I : I I I I : I I : I : : : : I I 

Db 637 VAEALPKPAPPKKPSAALPEQAK APV ADVEPKQP — KTTETLTDS 679 

Qy 390 DMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYIT 449 

II : I I : I I : I : : 1 : II I I : I 
Db 68 0 PSSAAATSKPAILSSQVQ AQAQVTTAPPLKTDSAKTSQSFPPTGD T 725 

Qy 450 CAP FN PAATESIATNIFPLLEDPTSENXTD-EKKIEEKKAQIVTEKNTSTKTSN 502 

I : II: I : : I I I I : : I I I I I II II 
Db 726 ITPLDSKAMPRPASDSKIVSH PGPTSESKDPVQKKEEPKKAQTKVTPKPDTK 777 

Qy 503 PFFVAAQDSETDYVTTDNLTKVTEEV VANMPEGLTPDLVQEACE 546 

I : |||::: : I : I : I II 

Db 77 8 PVPKGSPTPSGTRPTTGQATPQSQQPPKPPEQSRRFSLNLGGIADAPKS-QPTTPQET — 834 

Qy 547 SELNEVTGTKIAYETKMDLVQTSEVMQESLYPAAQLCPSFEESEATPSPVLP DIVM 602 

III : : ||:: III: III I : 

Db 835 VTGKLFGFGASI-FSQASNLIS TAGQQAPHPQTGPAAPSKQAPPPSQTLAA 884 

Qy 603 EAPLNSA — VPSAGASVIQ PSSSPLEASSVNYESI KHEP ENPP 643 

: I I I I I I I : : I I I : : III II 

Db 885 QGPPKSTGQHPSAPAKTTAVKKETKGPAAENLEAKPAQAPTVKKAEKDKKHPPGKVSKPP 94 4 

Qy 644 PYEEAMSVSLKVSGIKEEIKEPENINAALQETEAPYISIACDLIK-ETKLSAEPAPDFSD 702 

I III I : : | | | : | : : : | : | : 

Db 945 P TEPEKAVLAQKPDKTTKPKPACPLCRTELNVGSQDPPNFNT 986 

Qy 703 YSEMAKVE QPVPDHSELVE DSSPDSEPVD 731 

: I | | : | : | 11 = 11 

Db 987 CTECKNQVCNLCGFNPTPHLTEIQEWLCLNCQTQRAISGQLGDMDKMPPASSGPKASPVP 104 6 

Qy 732 LFSD DSIPDVPQKQDETV MLVKESLT 757 

: : : : : | : | : : : I : : I 

Db 1047 APAEPPPQKTPTAAHAKGKKKETEVKAETEKQIPEKETPSIEKTPPAVATDQKLEESEVT 1106 

Qy 758 ETSFESMIEYENKEKLSAL PPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSK 809 

: : : | : | : | | II I I I I hi III : I : 

Db 1107 KSLVSVLPEKKPSEEEKALPADKKEKKPPAAEAPPLEEKKPIPDDQK--LPPDAKPSASE 1164 

Qy 810 KEKIPLQMEELSTAVYSNDDLFISKEAQIR-ETETFSDSSPIEIIDEFPTLISSKTDSFS 868 

I: : : I I :: I I I : I I h I I : 
Db 1165 GEE KRDLLKAHVQI PEEGPIGKVASLACEGEQQPDTRPEDLPGATPQTLPKD 1216 

Qy 8 69 KLAREYTDLEVSHKSEIANAPDGAG — SLPCTELPHDLS--LKNIQPKVEEKISFSDDFS 924 

I : : I : I :| |. I II II : | | | : : I : I 



Db 



1217 - 



— RQKESRDVTQPQAEGTAKEGRGEPSKDRTEKEEDKSDTSSSQQPKSPQGLS-DTGYS 1272 



Qy 925 KNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSDTEKEDRSPSAIFSA 984 

: I : I II III:: lh III::: III: : 

Db 1273 SDGISGS LG EIPSLIPSD--EKDLLKGLKKDSFSQESSPSS — PS 1313 

Qy 985 DLGK — TSWDLL 995 

III : : I : : I 
Db 1314 DLAKLESTVLSIL 1326 



RESULT 13 
ANK2_HUMAN 

ID ANK2_HUMAN STANDARD; PRT; 3924 AA. 

AC Q01484; Q01485; 

DT 01-APR-1993 (Rel. 25, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Ankyrin 2 (Brain ankyrin) (Ankyrin B) (Ankyrin, nonerythroid) . 

GN ANK2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxI D= 9 6 0 6 ; 

RN [1] 

RP SEQUENCE FROM N . A. (ISOFORMS 1; 2 AND 3) . 

RC TISSUE=Brain stem; 

RX MEDLINE=913024 66; PubMed-1830053 ; 

RA Otto E., Kunimoto M. , McLaughlin T., Bennett V.; 

RT "Isolation and characterization of cDNAs encoding human brain 

RT ankyrins reveal a family of alternatively spliced genes."; 

RL J. Cell Biol. 114:241-253(1991). 

RN [2] 

RP REVISIONS. 

RA Carpenter S.; 

RL Submitted (MAY-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC TISSUE=Brain stem; 

RX MEDLINE=940754 09; PubMed=8253844 ; 

RA Chan W., Kordeli E., Bennett V. ; 

RT "440-kD ankyrinB: structure of the major developmentally regulated 

RT domain and selective localization in unmyelinated axons."; 

RL J. Cell Biol. 123:1463-1473(1993). 

RN [4] 

RP SEQUENCE OF 4 63-4 95 FROM N.A. 

RX MEDLINE=92009921; PubMed=1833308 ; 

RA Tse W.T., Menninger J.C., Yang-Feng T.L., Francke U., Sahr K.E., 

RA Lux S.E., Ward D.C., Forget B.G.; 

RT "Isolation and chromosomal localization of a novel nonerythroid 

RT ankyrin gene." ; 

RL Genomics 10:858-866(1991). 

CC -!- FUNCTION: Attach integral membrane proteins to cytoskeletal 
CC elements. Also bind to cytoskeletal proteins. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 



CC IsoId=Q01484-l; Sequence=Displayed; 

CC Name =2 ; 

CC IsoId=Q01484-2; Sequence=VSP_000267 , VSP_000268; 

CC Name=3; 

CC IsoId=Q01484-3; Sequence=VSP_000268 ; 

CC -!- TISSUE' SPECIFICITY: Plasma membrane of neurons as well as glial 
CC cells throughout the brain. 

CC -!- PTM: Phosphorylated at multiple sites by different protein kinases 

CC and each phosphorylation event regulates the protein ! s structure 

CC and function (Potential) . 

CC -!- SIMILARITY: Contains 23 ANK repeats. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X56957; CAA40278.1; 

DR EMBL; X56958; CAA40279 . 2 ; 

DR EMBL; Z26634; CAB42644.1; 

DR EMBL; M37123; AAA62828.1; 

DR PIR; S37431; S37431. 

DR HSSP; P42771; 1DC2 . 

DR Genew; HGNC:4 93; ANK2 . 

DR MIM; 106410; 

DR InterPro; IPR002110; ANK. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00023; ank; 24. 

DR Pfam; PF00531; death; 1. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01415; ANKYRIN. 

DR SMART; SM00248; ANK; 22. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50088; AN K_RE P EAT ; 20. 

DR PROSITE; PS50297; ANK_REP_REGION; 1. 

DR PROSITE; PS50017; D EAT H_DOMAI N ; 1. 

KW Cytoskeleton; Alternative splicing; Repeat; ANK repeat; 

KW Phosphorylation. 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



REPEAT 


63 


92 


ANK 


1. 


REPEAT 


96 


125 


ANK 


2. 


REPEAT 


129 


158 


ANK 


3. 


REPEAT 


162 


191 


ANK 


4. 


REPEAT 


193 


220 


ANK 


5. 


REPEAT 


232 


261 


ANK 


6. 


REPEAT 


265 


294 


ANK 


7. 


REPEAT 


298 


327 


ANK 


8. 


REPEAT 


331 


360 


ANK 


9. 


REPEAT 


364 


393 


ANK 


10 


REPEAT 


397 


426 


ANK 


11 


REPEAT 


430 


459 


ANK 


12 


REPEAT 


463 


492 


ANK 


13 



FT 


REPEAT 


496 


525 


ANK 14 




FT 


REPEAT 


529 


558 


ANK 15 




FT 


REPEAT 


562 


591 


ANK 16 


■ 


FT 


REPEAT 


595 


624 


ANK 17 




FT 


REPEAT 


628 


657 


ANK 18 




FT 


REPEAT 


661 


690 


ANK 19 




FT 


REPEAT 


694 


723 


ANK 20 




FT 


REPEAT 


727 


756 


ANK 21 




FT 


REPEAT 


760 


7 89 


ANK 22 




FT 


REPEAT 


793 


822 


ANK 23 




FT 


DOMAIN 


1773 


1950 


RE PEAT - 


-RICH REGION. 


FT 


REPEAT 


1773 


1784 


REPEAT 


A. 


FT 


REPEAT 


17 85 


17 96 


REPEAT 


A. 


FT 


REPEAT 


1797 


1808 


REPEAT 


A. 


FT 


REPEAT 


1809 


1820 


REPEAT 


A. 


FT 


REPEAT 


1821 


1832 


REPEAT 


A. 


FT 


REPEAT 


1833 


1844 


REPEAT 


A. 


FT 


REPEAT 


1845 


1856 


REPEAT 


A. 


FT 


REPEAT 


1857 


1867 


REPEAT 


A (APPROXIMATE) . 


FT 


REPEAT 


1868 


1879 


REPEAT 


A. 


FT 


REPEAT 


1880 


1891 


REPEAT 


A. 


FT 


REPEAT 


1892 


1902 


REPEAT 


A (APPROXIMATE) . 


FT 


REPEAT 


1903 


1914 


REPEAT 


A. 


FT 


REPEAT 


1915 


1926 


REPEAT 


A. 


FT 


REPEAT 


1927 


1938 


REPEAT 


A. 


FT 


REPEAT 


1939 


1950 


REPEAT 


A. 


FT 


DOMAIN 


3536 


3620 


DEATH. 




FT 


VARSPLIC 


1039 


1039 


Q -> QFLGKLHLPTAPPPLNEGESLVSRILQLGPPGTK 


FT 








(in isof orm 2 ) . 


FT 








/FTId=VSP_000267. 


FT 


VARSPLIC 


1444 


3528 


Missing (in isoform 2 and isoform 3) . 


FT 








/FTId=VSP_0002 68. 


FT 


CONFLICT 


475 


476 


GQ -> PE (IN REF. 4) . 


FT 


CONFLICT 


.971 


971 


I -> S 


(IN REF. 1) . 


FT 


CONFLICT 


3581 


3582 


QY -> HA (IN REF. 1) . 


FT 


CONFLICT 


3586 


3586 


I -> Y 


(IN REF. 1) . 


SQ 


SEQUENCE 


3924 


AA; 430337 


MW; 52AC496C428E29D2 CRC64; 



Query Match 5.0%; Score 299; DB 1; Length 3924; 

Best Local Similarity 21.5%; Pred. No. 0.00069; 

Matches 258; Conservative 167; Mismatches 417; Indels 358; Gaps 62; 

Qy 14 DSPP RPQPAFKYQFVREPEDEEEEEEEEEEDEDEDLE ELEVLERK 5 8 

III : I i I : ::::::: I I : II II 

Db 1648 DIPPDETQSTQKQHKPSLGIKKPVRRKLKEKQKQKEEGLQASAEKAELKKGSSEESLGED 17 07 

Qy 59 P AAG L S AAP VP T AP AAGAP LMD FGNDFVPPAPRG 92 

I I I : I : I I I : I I : : I : I : I 

Db 1708 P--GIAPEPLPTWAT-SPLIEETPIGSIKDKVKALQKRVEDEQKGRSKLPIRVKGKEDV 1764 



Qy 93 PLPAA-PPVAPERQPSWDPSP VSSTVPAPSPL 123 

11111:11 III II II: 

Db 1765 PKKTTHRPHPAASPSLKSERHAPGSPSPKTERHSTLSSSAKTERHPPVSPSSKTEKHSPV 1824 



Qy 124 SAAA VSPSKLPEDDEPPA RPPPPPPASVSPQAEPVW 159 

I : I I I I I I I : I I I I : : : I I 

Db 1825 SPSAKTERHSPASSSSKTEKHSPVSPSTKTERHSPVSSTKTERHPPVSPSGKTDKRPPV- 1883 



Qy 160 T P PAP APAAP P S T P AAP KRR GSSGAWXXXXKIMDLKEQPGNTISAGQEDFPSVLLE 216 

: I 11:1 : : I II I I I : I : : I . 

Db 1884 SPSGRTEKHPPVSPGRTEKRLPVSPSGRTD KHQPVSTAGKTEKHLPVSPSG 1934 

Qy 217 TAASXPSLSPLSAAS-FKEHEYLGNLSTVLPT EGTLQENVSEASKEVSEKAKT 2 68 

I : I I I : I : I : : I I : I hill 

Db 1935 KTEKQPPVSPTSKTERIEETMSVRELMKAFQSGQDPSKHKTGLFEHKSAKQKQPQEKGKV 1994 

Qy 269 LLIDRDLTEFSELEYSEMGSSFSVSPKAES— AVIVANPREEIIVKNKDE — 316 

: I I: : : I : : I h III I h : I I I : I 

Db 1995 RVEKEKGPILTQRE-AQKTENQTIKRGQRLPVTGTAESKRGVRVSS 1 GVKKEDAAG 2049 

Qy 317 -EEKLVSNNI LHXQQELPTALTKLVKE DEW 34 6 

: I I: : I: I I : h I hi I I : 

Db 2050 GKEKVLSHKI PEPVQSVPEEESHRESEVPK — EKMADEQGDMDLQISPDRKTSTDFSEVI 2107 

Qy 347 S S E - KAKDS FNEKRVAVEAPMREE YAD FKPFERVWEVKDSKED 388 

I : I : : I : : I : : I II : : h : 

Db 2108 KQELEDNDKYQQFRLSEETEKAQLHLDQVLTSPFNTTFPLDYMKDEFLPALSLQSGALDG 2167 

Qy 389 -SDMLAAGGKIESNLESKVDKKCFADSLEQTNHE KDSESSNDDTSFPSTPEGIKDRS 444 

hi I I I : : II II : | : | : | | : | : : : : : 

Db 2168 SSESLKNEGVAGSPCGSLMEGTPQI SS EES YKHEGLAET PETS PESLSF — SPKKSEEQT 2225 

Qy 445 GAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPF 504 

I : II I I h : : I I I : I : : I I I : : 

Db 2226 G ETKESTKTETTTEIRSEKEHPTTKDITGGS — EERGATVTEDSETSTESFQKE 2277 

Qy 505 FVAAQDSETDYVTTDNLTKVTEEWA-NMPEGLTPDLVQEACESELNEVTGTKIAYETKM 563 

: I hi : I I I I I : I I I h h : I 

Db 2278 ATLGSPKDTSPKRQDDCTGSCSVALAKETPTGLT EEAACDEGQRTFGSS-AHKT — 2330 

Qy 564 DLVQT S EVMQES LYPAAQLCP S FEES EAT P S PVL PDIVMEAP — LNSAV 610 

II III : : | : : | | | I I : : I I h 

Db 2331 QTDSEAQES TATSDETKALPLPEASVKTDTGTESKPQGVIRSPQGLELAL 2380 

Qy 611 PSAGASVIQ PS SS PLEAS SVNYESIKHE- PEN — PPPYEEA-MSVSLKVSG 657 

II : h II I I I : : h h : I I : h I h I 

Db 2381 PSRDSEVLSAVADDSLAVSHKDSLEAS PVLEDNSSHKTPDSLEPSPLKESPCRDSLESSP 2440 

Qy 658 IKEEIKE PEN — INAALQETE — APYISIACDLIKETKLSAEPAPDFSDYSEMAKVE 710 

: : : : I I : : I h : I I I : I : : : III : I : I 

Db 2441 VEPKMKAGIFPSHFPLPAAVAKTELLTEVASVRSRLLRD PDGS — AEDDSLE 24 90 

Qy 711 QPVPDHSELVEDS SPDSEPVDLFSDDSI PDVPQKQDETVMLVKESLTETSFESMI 765 

I : h I I Mh : I : I : : I I 
Db 2491 Q TSLMESSGKSPLSPDTPSSEEVSYEVTPKTTDVSTPKPAVIHECAEED 2539 

Qy 766 EYENKEKLSALPPEGGKPYLESFKLSLDNTKDTLLPDEVSTLSKK EKIPLQMEELS 821 

: I I I I II III: I I : I h : h : I I I I I 

Db 2540 DSENGEKKRFTPEE EMFKMV TKIKMF-DELEQEAKQKRDYKKEPKQEESSS 2589 

Qy 822 TAVYSNDDLFISKEA-QIRETETFSDSSPIEIIDEFPTLISSKTDSFSKLAREYTDLEVS 880 

: hi I : : : I : I I : I |: : I : : I : 
Db 2590 S SDPDADCSVDVDEPKHTGSGEDESGV PVLVTSESRKVSSSSES 2633 



Qy 881 HKSEIANAPDGAGS LPCTELPHDLSLKN IQPKVEEKI SF--SDD 922 

: I : I III | : | | : : II I : : : I : : I 

Db 2 634 -EPELAQLKKGADSGLLPEPVIRVQPPSPLPSSMDSNSSPEEVQFQPWSKQYTFKMNED 2 692 

Qy 923 FSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKKLPSD TEKEDRS 977 

: : : I I :: I I I I I I I II 

Db 2693 TQEEPGKSEE EKDSESHLAEDRHAVSTEAEDRS 2725 



RESULT 14 
MAPB HUMAN 



ID MAP B_HUMAN STANDARD; PRT; , 2468 AA. 

AC P46821; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32 f Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) [Contains: MAPI light chain 

DE LCI] . 

GN MAP1B. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fetal brain; 

RX MEDLINE=9 5 104 835; PubMed=7 806212; 

RA Lien L.L., Feener C, Fischbach N., Kunkel L.M.; 

RT "Cloning of human microtubule-associated protein IB and the 

RT identification of a related gene on chromosome 15."; 

RL Genomics 22:273-280(1994). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 

CC Phosphorylated MAP1B may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAP1B Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 

CC with MAP 1A and MAP1B proteins. 

CC -!- DOMAIN: Has a highly basic region with many copies of the sequence 

CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP1B to microtubules. 

CC -!- PTM: LCI is coexpressed with MAP1B. It is a polypeptide generated 

CC from MAP1B by proteolytic processing. It is free to associate with 

CC both MAP1A and MAP IB. It interacts with the amino-terminal region 

CC of MAP1B (By similarity) . 

CC -!- SIMILARITY: TO MAP1A. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; L06237; AAA18904.1; 



DR Genew; HGNC:6836; MAP IB . 

DR MIM; 157129; 

DR GO; GO: 0005875; C : microtubule associated complex; TAS . 

DR InterPro; IPR000102; MAPlB_neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN; 6. 

KW Microtubule; Repeat; Phosphorylation. 

FT CHAIN ? 2 468 MAPI LIGHT CHAIN LCI. 

FT REPEAT 1878 1894 MAP1B 1. 

FT REPEAT 1895 1911 MAPlB 2. 

FT REPEAT 1912 1928 MAP IB 3. 

FT REPEAT 1929 1945 MAPlB 4. 

FT REPEAT 1946 1962 MAPlB 5. 

FT REPEAT 1963 1979 MAPlB 6. 

FT REPEAT 1997 2013 MAPlB 7. 

FT REPEAT 2014 2030 MAPlB 8. 

FT REPEAT 2031 2047 MAPlB 9. 

FT REPEAT 2048 2064 MAPlB 10. 

FT DOMAIN 5 89 790 LYS-RICH (HIGHLY BASIC, CONTAINS MANY 

FT KKEE AND KKEI/V REPEATS) . 

SQ SEQUENCE 2468 AA; 270618 MW; 540839CBDF09D4 61 CRC64; 

Query Match 5.0%; Score 296.5; DB 1; Length 2468; 

Best Local Similarity 21.5%; Pred. No. 0.00049; 

Matches 264; Conservative 148; Mismatches 459; Indels 355; Gaps 59; 

Qy 2 EDLDQSPLVSSS-DSPPRPQPAFKYQ FVREP EDEE 35 

M :: 11:1 II : : I : : = M I i : 

Db 956 EDGEEHVCVSASKHS PTEDEESAKAEADAYI REKRESVASGDDRAEEDMDEAI EKGEAEQ 1015 

Qy 36 EEEEEEEEDEDEDLEELEV-LERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPAPRGPL 94 

■ | | I : I I I : I I I I I : I I I I I I M I I 

Db 1016 SEEEADEEDKAEDAREEEYEPEmEAEDYVMAVVDKAAEAGGAEEQYGFLTTPTKQLG-- 1073 

Q y 95 PAAPPVAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPPPASVSPQ 154 

:| |:|: I hi I I I hi I Is 

Db 1074 AQSPGREPA — SSIHDETLPGGSESEATA SDEENREDQPEEFTATSGYT 1120 

Qv 155 AEPVWTPPAPAPAAPPSTPAAPKRRGSSGAWXXXXKIMDLKEQPGNTISAGQE D 2 09 

:| : I I II 

Db 112 l QSTIEISSEPTPMDEMSTP RDVMSDETNNEETESPSQEFVNITK 1164 

Qy 210 FPSVLLETAASXPS-LSPLSAASFKE HEYLGNLSTVLPTEGTLQENVSEAS 259 

: | I | | : : : | | : II : | : | | : I : : I : : 

Db 1165 YESSLYSQEYSKPADVTPLN— GFSEGSKTDATDGKDYNASASTISPPSSMEEDKFSRSA 1222 

Qv 260 KEVSEKAKTLLIDRDLTEF SELEYSEMGS SFS 291 

,1,11: I II : :l =11 

Db 1223 LRDAYCSEV — KASTTLDIKDSISAVSSEKVSPSKSPSLSPSPPSPLEKTPLGERSVNFS 12 80 

Qy 292 VSP KAESAVIVANPR--EEIIVKN--KDEEEKL VSNNILHX 328 

: : | I I : I : I : I : : : : I : : I I : : I 

Db 1281 LTPNEIKVSAEAEVAPVSPEVTQEWEEHCASPEDKTLEWSPSQSVTGSAGHTPYYQSP 1340 

Qy 329 QQELPTALTKLVKEDEWSSEKAKDSFNEKRVAVEAPMREEYADFK-PFERVWEVK 383 

|||: : I I I : I : : I I =111 I : I I : I 
Db 1341 TDEKSSHLPTEV — IEKPPAVPVSFEFSDAKDENERASVSPMDEPVPDSESPIEKVLSPL 1398 



Qy 384 DSKEDSDMLAAGGKIESNLESKVDKKCFADSLEQTNHEKDSESSNDDTSFPSTPEGIKDR 443 

J : III: II I I I : : I I : 
Db 1399 RS PPLIGSESAYESFLSA — DDKASGRGAESPFEEKSGKQGSPDQVS PVSE 1447 

Qy 444 SGAYITCAPFNPAATESIATNIFPLLEDPTSENXTDEKK IEEKK-- A 488 

:| I :|: I: II III:: ::hl 

Db 1448 MTSTSLYQDKQEGKSTDFAPIKEDFGQEKKTDDVEAMSSQPALALDERKLGDVSPT 1503 

Qy 489 QI VTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTKVTEEWA — N 531 

t I : : I I I : : I I . : I I : : I : I I : 

Db 1504 QIDVSQFGSFKEDTKMSISEGTVSDKSATP — VD E GVAE D T Y S HME GVAS VS T AS VAT S S 1561 

Qy 532 MPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAAQLCP 584 

II III : : I : I : : : : I I I I | : : I : : I I 

Db 1562 FPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSKEECPRPMSIS 1620 

Qy 585 SFEESEATPSPVLP-DIVMEAPLNSAVPSAGASVI 618 

II : : I I I : : I : I : I I I : 
Db 1621 PPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEQSLAMDFSRQSPDH PTVGAGVL 1677 

Qy 619 Q PSSSPLEASSVNYESIKHE — PENPPPY EEAMSVS-LKVSGIKEEIKE 664 

I : I : I : I : I II I : I I I : : I 

Db 1678 HITENGPTEVDYSPSDMQDSSLSHKIPPMEEPSYTQDNDLSELISVSQVEASPSTSSAHT 1737 

Qy 665 PENINAALQETE AP YISIACDLIKE TKLSAEP A 697 

I I : I I I II ||:::: III : : 

Db 1738 PSQIASPLQEDTLSDVAPPRDMSLYASLTSEKVQSLEGEKLSPKSDISPLTPRESSPLYS 1797 

Qy 698 PDFSDYSEMAKVEQPVPDHSELVEDSSPDSEPVDLFS 734 

INI: II: II III I : I I 
Db 1798 PTFSDSTSAVK-EKTATCHS SSSP PIDAASAEPYGFRASVLFDTMQHHLALNR 1849 

Qy 735 DDSIPDVP : QKQDETVMLVKESLTETSFESMIEYENKEKLSALPPE 77 9 

III: I I : I I I : I : II : I I : 

Db 1850 DLSTPGLEKDSGGKTPGDFSYAYQKPEET — TRSPDEEDYDYESYEKTTRTSDV 1901 

Qy 780 GGKPYLESFKLSLDNTKDTLLPDEVSTLSKKEKIPLQMEELSTAVYSNDDLFISKEAQIR 839 

I I I I : : : I : I I : I I I I I : I I : 

Db 1902 GGY-YYEKIERTTKSPSDSGYSYE— TIGKTTKTP EDGDYSYE — IIEKTTRTP 1950 

Qy 840 ETETFSDSSPIEIIDEFPTLISSKTDS FSKLAREYTDLEVSHKSEIANAPDGAGSLP 896 

I : I I I I I I I : I I : : I : I I : 
Db 1951 EEGGYSYD ISEKTTSPPEVSGYSYEKTERSRRLLDDISNGYDDS 1994 

Qy 897 CTELPHDLSLKNIQPKVEEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAE-IESIVK 955 

: I I : : II I : : II III : I I I - 

Db 1995 -EDGGHTLGDPSYSYETTEKITSFPESEGYSYETSTKTTRTPDTSTYCYETAEKITRTPQ 2053 

Qy 956 PKVLEKE AEKKLPSDTEKE 974 

I II I I II : :: 

Db 2 054 ASTYSYETSDLCYTAEKKSPSEARQD 2079 



RESULT 15 
MAPB_RAT 

ID MAPB_RAT STANDARD; PRT; 2459 AA. 



AC P15205; Q62958; Q9ER21; Q9QW92; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Microtubule-associated protein IB (MAP IB) (Neuraxin) [Contains: MAPI 

DE light chain LCI] . 

GN MAPlB. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE OF 1-142 FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Tes tis ; 

RX MEDLINE=96257242; PubMed=8 6662 95 ; 

RA Liu D., Fischer I.; 

RT "Isolation and sequencing of the 5 1 end of the rat microtubule- 

RT associated protein (MAP1B) -encoding cDNA. " ; 

RL Gene 172:307-308(19 96). 

RN [2] 

RP SEQUENCE OF 96-2459 FROM N.A. , DOMAIN, AND INDUCTION. 

RC STRAIN=Sprague-Dawley; TISSUE=Brain, and Glial tumor; 

RX MEDLINE=92347374; PubMed-1639092 ; 

RA Zauner W. , Kratz J., Staunton J., Feick P., Wiche G. ; 

RT "Identification of two distinct microtubule binding domains on 

RT recombinant rat MAP IB."; 

RL Eur. J. Cell Biol. 57:66-74(1992). 

RN [3] 

RP SEQUENCE OF 1541-2459 FROM N.A. , AND TISSUE SPECIFICITY. 

RC TISSUE=Spinal cord; 

RX MEDLINE=90059871; PubMed=2555150; 

RA Rienitz A., Grenningloh G., Hermans-Borgmeyer I., Kirsch J., 

RA Littauer U.Z., Prior P., Gundelfinger E.D., Schmitt B., Betz H.; 

RT "Neuraxin, a novel putative structural protein of the rat central 

RT nervous system that is immunologically related to microtubule- 

RT associated protein 5."; 

RL EMBO J. 8:2879-2888(1989). 

RN [4] 

RP DEVELOPMENTAL STAGE, AND PHOSPHORYLATION. 

RX MEDLINE=97405699; PubMed=9260743; 

RA Ma D,, Nothias F. , Boyne L.J., Fischer I.; 

RT "Differential regulation of microtubule-associated protein IB (MAPlB) 

RT in rat CNS and PNS during development."; 

RL J. Neurosci. Res. 49:319-332(1997). 

CC -!- FUNCTION: The function of brain MAPS is essentially unknown. 
CC Phosphorylated MAP IB may play a role in the cytoskeletal changes 

CC that accompany neurite extension. Possibly MAPlB Binds to at least 

CC two tubulin subunits in the polymer, and this bridging of subunits 

CC might be involved in nucleating microtubule polymerization and in 

CC stabilizing microtubules. 

CC -!- SUBUNIT: 3 different light chains, LCI, LC2 and LC3, can associate 
CC with MAPI A and MAPlB proteins. 

CC -!- TISSUE SPECIFICITY: Nervous system (spinal cord, brain stem, 

CC cerebellum and cerebrum) . Not expressed in liver, spleen, kidney, 

CC heart or muscle. 

CC -!- DEVELOPMENTAL STAGE: In cerebral cortex, spinal cord and sciatic 
CC nerve levels are high early in development but decrease during 



CC postnatal development and are low in adults . In dorsal root 

CC ganglia levels remain high throughout development. 

CC -!- INDUCTION: By nerve growth factor. 

CC DOMAIN: Has a highly basic region with many copies of the sequence 

CC KKEE and KKEI/V, repeated but not at fixed intervals, which is 

CC responsible for the binding of MAP1B to microtubules. 

CC -!- PTM: LCI is coexpressed with MAP IB . It is a polypeptide generated 
CC from MAP1B by proteolytic processing. It is free to associate with 

CC both MAPI A and MAP IB. It interacts with the amino-terminal region 

CC of MAP IB (By similarity) . 

CC -!- PTM: Phosphorylated. 

CC -!- SIMILARITY: TO MAP1A. 

CC -!- CAUTION: A C-terminal fragment of this protein (residues 1597 to 
CC 2459) was originally described as neuraxin in Ref.3. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U52950; AAB17068.1; -. 

DR EMBL; X60370; CAC16162.1; 

DR EMBL; X16623; CAA34620.1; ALT_SEQ. 

DR PIR; A56577; A56577. 

DR InterPro; IPR000102; MAPlB__neuraxin . 

DR Pfam; PF00414; MAPlB_neuraxin; 10. 

DR PROSITE; PS00230; MAP1B_NEURAXIN; 8. 

KW Microtubule; Repeat; Phosphorylation. 



FT 


CHAIN 


? 


2459 


MAPI 


LIGHT CHAIN LCI. 


FT 


REPEAT 


1869 


1885 


MAP1B 


1. 


FT 


REPEAT 


1886 


1902 


MAP1B 


2. 


FT 


REPEAT 


1903 


1919 


MAP1B 


3. 


FT 


REPEAT 


1920 


1936 


MAP1B 


4. 


FT 


REPEAT 


1937 


1953 


MAP IB 


5. 


FT 


REPEAT 


1954 


1970 


MAP1B 


6. 


FT 


REPEAT 


1988 


2004 


MAP1B 


7. 


FT 


REPEAT 


2005 


2021 


MAP1B 


8. 


FT 


REPEAT 


2022 


2038 


MAP IB 


9. 


FT 


REPEAT 


2039 


2055 


MAP1B 


10. 


FT 


DOMAIN 


559 


1035 


GLU-RICH. 


FT 


DOMAIN 


588 


786 


LYS-RICH (HIGHLY BASIC, CONTAINS MANY 


FT 








KKEE 


AND KKEI/V REPEATS) . 


FT 


DOMAIN 


2224 


2312 


LYS-RICH. 


FT 


CONFLICT 


127 


127 


M -> 


V (IN REF. 1) . 


FT 


CONFLICT 


140 


140 


T -> 


S (IN REF. 1) . 


FT 


CONFLICT 


2112 


2112 


R -> 


K (IN REF. 3) . 


FT 


CONFLICT 


2169 


2169 


L -> 


I (IN REF. 3) . 


SQ 


SEQUENCE 


2459 


AA; 269497 


MW; 2E3F6872DEDB8BA2 CRC64; 


Query Match 




4.9%; 


Score 


289; DB 1; Length 2459; 



Best Local Similarity 20.9%; Pred. No. 0.00094; 

Matches 275; Conservative 158; Mismatches 462; Indels 422; Gaps 61; 
Qy 30 EPEDEEEEEEEEEEDEDEDLEELEVLERKPAAGLSAAPVPTAPAAGAPLMDFGNDFVPPA 89 



II Mill: : : II : : I III II : I I : 
Db 1004 EAEQSEEEGEEEEDKAEDAREEDHEPDKTEAEDYVMAVVDKAAEAGVTEDQY — DFL 1058 

Qy 90 PRGPLPAAPP— VAPERQPSWDPSPVSSTVPAPSPLSAAAVSPSKLPEDDEPPARPPPPP 147 

III : I I : I : I I : I I II hi I 

Db 1059 GTPAKQPGVQSPSREPA — SSIHDETLPGGSESEATA SDEENREDQPEEF 1106 

Qy 148 PAS VS P QAE P VWT P PAP AP AAP P S T P AAP KRRG S S GAWXXXXKI MDLKEQ P GNT I S AGQ 2 07 

I : : I I I I I : I : III 

Db 1107 TATSGYTQSTIEISSEPTPMDEMSTP RDVMTDETNNEETESPSQ 1150 

Qy 208 E DFPSVLLETAASXP SLSPLSAASFKE HEYLGNLSTVLPTEGTLQENV 255 

I : I I II | : M I : : I : I I : I :: 

Db 1151 EFVNITKYESSLYSQEYSKPWAS FNGLSDGSKTDATDGRDYNASASTISPPSSMEEDKF 1210 

Qy 256 SEAS KEVSEK AKTLLIDRDLTE 277 

I : : : I : I I : : II I : I : 

Db 1211 SKSALRDAYRPEETDVKTGAELDIKDVSDERLSPAKSPSLSPSPPSPIEKTPLGERSV-N 1269 

Qy 278 FS ELEYSEMGSSFS-VSPKAESAVI VAN PRE EIIVKNK 314 

II I : : I I : : I I I I I : hi I I : : : : 

Db 1270 FS LTPNEI KASAEGEATAVVS PGVTQAVVEEHCAS PEEKTLEWS PSQS VTGSAGHTP YY 1329 

Qy 315 DEEEKLVSNNILHXQQELPTALTKLVKEDEWSSE — KAKD S FNEKRVAVE1APMRE 368 

II: : : I : I I I I : I I I I I : : : I I I 

Db 1330 QSPTDEKSSHLPTEVTENAQAVP VSFEFTEAKDE-NER — SSISPMDE 1374 

Qy 369 EYADFK-PFERVWE VKDSKED SDMLAAGGKIESNLESKVDKKCFADSL 415 

I : I I : I : : I I : I I I : I 1 I I I : I : I 

Db 1375 PVPDSESPIEKVLSPLRSPPLIGSESAYEDFLSADDKALGRRSESPFEGKNGKQGFSD — 1432 

Qy 416 EQTNHEKDSESSNDDTSFPSTPEGIKDRSGAYITCAPFNP AATESIATNI FPLLED- 471 

I : I I I I I : : I : I hi h I : : II 

Db 1433 KESPVS-DLTSDLYQDKQEEKRAGFIPIKEDFSPEKKASDAEIMSSQSALALDE 1485 

Qy 472 PTSENXTDEKKIEEKKAQIVTEKNTSTKTSNPFFVAAQDSETDYVTTDNLTK 523 

II:: : I : : I I h : I h h I : : 
Db 1486 RKLGGDGSPTQVDVSQFGSFKEDTKMSISEGTVSDKSATPVDEGAEDT YSHMEGVAS 1542 

Qy 524 VTEEWA — NMPEGLTPDLVQEACESELNEVTGTKIAYETKMDLVQTSEVMQES-LYPAA 58 0 

I : II : I I III : : h I :: : : I I I I h : h 

Db 1543 VSTASVATSSFPEPTTDD-VSPSLHAEVGSPHSTEVDDSLSVSWQTPTTFQETEMSPSK 1601 

Qy 581 QLCP S FEES EAT PSPVLPDIVMEAPLNSAV P 611 

: I I : : : I : : | : I : I I 

Db 1602 EECPRPMSISPPDFSPKTAKSRTPVQDHRSEQSSMSIEFGQESPEHSLAMDFSRQSPDHP 1661 

Qy 612 SAGASVIQ PSSSPLEASSVNYESIKHE — PENPPPY EEAMSVS-LKVSG 657 

: I I : : h I : h h I II I : I I I : : I 

Db 1662 TVGAGMLHITENGPTEVDYSPSDIQDSSLSHKIPPTEEPSYTQDNDLSELISVSQVEASP 1721 

Qy 658 IKEEIKEPENINAALQETE APYI SIACDLIKE TKLSAEP 696 

I I : I II : | |: | : :: | | | : 

Db 1722 STSSAHTPSQIASPLQEDTLSDWPPRDMSLYASLASEKVQSLEGEKLSPKSDISPLTPR 1781 

Qy 697 APDFSDYSEMAK VEQPVPDHSELVEDS 723 

: I I I I : I I : I I : : I : 



Db 1782 ESSPTYSPGFSDSTSGAKESTAAYQTSSSPPIDAAAAEPYGFRSSMLFDTMQHHLALSRD 1841 

Qy 724 SPDSEPVDLFSDD SIPDVP QKQD 746 

llllll: II : I : 

Db 1842 LTTSSVEKDNGGKTPGDFNYAYQKPESTTESPDEEDYDYESHEKTIQAHDVGGYYYEKTE 1901 

Qy 747 ETVML VKESLTETSFESMIEYENKEKLSALPPEGGKPYLESFKLSLDNTKD 7 97 

I : I I : I : I I I : I I I I I : : : 

Db 1902 RTIKSPCDSGYSYETIEKTTKTPEDGGYSCEITEKTTRTPEEGGYSY EISEK 1953 

Qy 7 98 TLLPDEVS — TLSKKEKIPLQMEELSTAVYSNDD LFISKEA 836 

I llllll: : : : : | : I I I : 

Db 1954 TTRTPEVSGYTYEKTERSRRLLDDISNGYDDTEDGGHTLGDCSYSYETTEKITSFPESES 2013 

Qy 837 QIRETETFSDSSP IEIIDEFPTLISSKTDSFSKLAREYT DL 877 

III: II I : : : : I I : III II 

Db 2014 YSYETTTKTTRSPDTSAYCYETMEKITKTPQASTYSYETSDRCYTPERKSPSEARQDVDL 2073 

Qy 87 8 EVSH-KSEIANA PDG AGSLPCTELPHDLSLKNIQP KV 913 

t I I : I : : : I : I I I I I : I : 

Db 2074 CLVSSCEFKHPKTELSPSFINPNPLEWFAGEEPTEESERPLTQSGGAPPPSGGKQQGRQC 2133 

Qy 914 EEKISFSDDFSKNGSATSKVLLLPPDVSALGHTQAEIESIVKPKVLEKEAEKK-LPSD-- 970 

: I I I II II: III | : | | : : | : | 

Db 2134 DETPPTSVSESAPSQTDSDV PPETE ECPSITADANLDSEDESETIPTDKT 2183 

Qy 971 TEKEDRSPS AI FSADLGKTSWDLLYWRDIKKTG 1004 

: I I I I I : I I I II III 

Db 2184 VTYKHMDPPPAPMQDRSPSPRHPDVSMVDPEALAIEQNLGKALKKDLKEKAKTKKPG 2240 



Search completed: September 29, 2004, 18:14:46 
Job time : 24.8874 sees 



